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involved in non-physiological proliferative growth, and in particular involved in malignant or benign tumors, including 
atherosclerotic plaques. 

The term \vildtyp cell" is used to indicate the cell not harbouring an aberrant chromosome. "WiWtype" or "normal" 
chromosome refers to a non-aberrant chromosome. 

Diagnostic methods are based on the fact that an aberration in a chromosome results in a detectable alteration in 
the chromosomes' appearance or biochemical behaviour. A translocation, for example will result in a first part of the 
chromosome (and consequently of the MAG gene) having been substituted for another (second) part (further referred 
to as lirst and second substitution parts"). The first part will often appear someplace else on another chromosome from 
which the second part originates. As a consequence hybrids will be formed between the remaining parts of both (or in 
cases of triple translocations, even more) chromosomes and the substitution parts provided by their translocation part- 
ners. Since it has now been found that the breaks occur in a MAG gene this will result in hybrid gene products of that 
MAG gene. Markers, such as hybridising molecules like RNA, DNA or DNA/RNA hybrids, or antibodies will be able to 
detect such hybrids, both on the DNA level, and on the RNA or protein level. 

For example, the transcript of a hybrid will still comprise the region provided by the remaining part of the gene/chro- 
mosome but will miss the region provided by the substitution part that has been translocated. In the case of inversions, 
deletions and insertions the gene may be equally afflicted. 

Translocations are usually also cytogenetically detectable. The other aberrations are more difficult to find because 
they are often not visible on a cytogenetical level. The invention now provides possibilities for diagnosing all these types 
of chromosomal aberrations. 

In translocations markers or probes based on the MAG gene for the remaining and substitution parts of a chromo- 
some insity detect the remaining part on the original chromosome but the substitution part on another, the translocation 
partner. 

In the case of inversions for example, two probes will hybridise at a specific distance in the wildtype gene. This dis- 
tance might however change due to an inversion. In situ such inversion may thus be visualized by labeling a set of suit- 
able probes with the same or different detectable markers, such as f luorescent labels. Deletions and insertions may be 
detected in a similar manner. 

According to the invention the above in situ applications can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmkte one of which comprises exons 1 to 3 of the MAG gene, while the other 
comprises exons 4 and 5. Both cosmids are labeled with different fluorescent markers, e.g. blue and yellow. The normal 
chromosome will show a combination of both labels, thus giving a green signal, while the translocation is visible as a 
blue signal on the remaining part of one chromosome (e.g. 1 2) while the yellow signal is found on another chromosome 
comprising the substitution part. In case the same labels are used for both probes, the intensity of the signal on the nor- 
mal chromosome will be 100%, while the signal on the aberrant chromosomes is 50%. In the case of inversions one of 
the signals shifts from one place on the normal chromosome to another on the aberrant one. 

In the above applications a reference must be included for comparison. Usually only one of the two chromosomes 
is afflicted. It will thus be very convenient to use the normal chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the MAG gene of chromosome 12, breaks are usually found in the intron 
between exons 3 and 4 as is shown by the present invention. Probes based on exons 1 to 3 and 4 and 5 are thus very 
useful. As an alternative a combination of probes based on both translocation a fusion partners may be used. For 
example, for the identification of lipomas one may use probes based on exons 1 to 3 of the HMGI-C gene on the one 
hand and based on the LIM domains of the LPP gene on the other hand. 

"Probes" as used herein should be widely interpreted and include but are not limited to linear DNA or RNA strands. 
Yeast Artificial Chromosomes (YACs), or circular DNA forms, such as plasmids. phages, cosmids etc.. 

These in situ methods may be used on metaphase and interphase chromosomes. 

Besides the above described in situ methods various diagnostic techniques may be performed on a more biochem- 
ical level, for example based on alterations in the DNA, RNA or protein. Basis for these methods is the fact that by 
choosing suitable probes, variations in the length or composition in the gene, transcript or protein may be detected on 
a gel or Wot. Variations in length are visible because the normal gene, transcript(s) or proteins) will appear in another 
place on the gel or blot then the aberrant one(s). In case of a translocation more than the normal number of spots will 
appear. 

Based on the above principles the invention thus relates to a method of diagnosing cells having a non-physiological 
proliferative capacity, comprising the steps of taking a biopsy of the cells to be diagnosed, isolating a suitable MAG 
gene-related macromolecule therefrom, and analysing the macromolecule thus obtained by comparison with a refer- 
ence molecule originating from cells not showing a non-physiological proliferative capacity, preferably from the same 
individual. The MAG gene-related macromolecule may thus be a DNA, an RNA or a protein. The MAG gene may be 
either a member of the HMG family or of the LIM family. 

In a specific embodiment the diagnostic method of the invention comprises the steps of taking a biopsy of the cells 
to be diagnosed, extracting total RNA thereof, preparing a first strand cDNA of the mRNA species in the total RNA 
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extract or poly-A-selected fraction(s) thereof, which cDNA comprises a suitable tail; performing a PCR using a MAG 
gene specific primer and a tail-specific primer in order to amplify MAG gene specific cDNA's; separating the PCR prod- 
ucts on a gel to obtain a pattern of bands; evaluating the presence of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual. 

s As an alternative amplification may be performed by means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [81] or variations thereof. 

In another embodiment the method comprises the steps of taking a biopsy of the cells to be diagnosed, isolating 
total protein therefrom, separating the total protein on a gel to obtain essentially individual bands, optionally transfering 
the bands to a Western blot, hybridising the bands thus obtained with antibodies directed against a part of the protein 

10 encoded by the remaining part of the MAG gene and against a part of the protein encoded by the substitution part of 
the MAG gene; visualising the antigen-antibody reactions and establishing the presence of aberrant bands by compar- 
ison with bands from wildtype proteins, preferably originating from the same individual. 

In a further embodiment the method comprises taking a biopsy of the cells to be diagnosed; isolating total DNA 
therefrom; digesting the DNA with one or more so-called "rare cutter" (typically "6- or more cutters") restriction 

15 enzymes; separating the digest thus prepared on a gel to obtain a separation pattern; optionally transfering the sepa- 
ration pattern to a Southern blot; hybridising the separation pattern in the gel or on the blot with a set of probes under 
hybridising conditions; visualising the hybridisations and establishing the presence of aberrant bands by comparison to 
wildtype bands, preferably originating from the same individual. 

The diagnostic method of the invention may be used for diseases wherein cells having a non-physiological prolifer- 

20 ative capacity are selected from the group consisting of benign tumors, such as the mesenchymal tumors hamartomas 
(e.g. breast and lung), lipomas, pleomorphic salivary gland adenomas, uterine leiomyomas, angiomyxomas, fibroade- 
nomas of the breast, polyps of the endometrium, atherosclerotic plaques, and other benign tumors as well as various 
malignant tumors, including but not limited to sarcomas (e.g. rhabdomyosarcoma, osteosarcoma) and carcinomas (e.g. 
of breast lung, skin, thyroid). 

25 Recent publications indicate that atherosclerotic plaques also involve abnormal proliferation [26] and it was postu- 
lated that atherosclerotic plaques constitute benign tumors [27]. Therefore, this type of disorder is also to be understood 
as a possible indication for the use of the MAG gene family, in particular in diagnostic and therapeutic applications. 

It has been found that in certain malignant tumors the expression level of the HMG genes is increased [28]. Another 
aspect of the invention thus relates to the implementation of the identification of the MAG genes in therapy. The inven- 

30 tion for example provides anti-sense molecules or expression inhibitors of the MAG gene for use in the treatment of dis- 
eases involving cells having a non-physiological proliferative capacity by modulating the expression of the gene. 

The invention thus provides derivatives of the MAG gene for use in diagnosis and the preparation of therapeutical 
compositions, wherein the derivatives are selected from the group consisting of sense and anti-sense cDNA or frag- 
ments thereof, transcripts of the gene or fragments thereof, triple helix inducing molecule or other types of "transcription 

35 clamps", fragments of the gene or its complementary strand, proteins encoded by the gene or fragments thereof, pro- 
tein nucleic acids (PNA), antibodies directed to the gene, the cDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. 

It is to be understood that the principles of the present invention are described herein for illustration purposes only 
with reference to the HMGI-C gene mapping at chromosome 12 and the LPP gene on chromosome 3. Based on the 

40 information provided in this application the skilled person will be able to isolate and sequence corresponding genes of 
the gene family and apply the principles of this invention by using the gene and its sequence without departing from the 
scope of the general concept of this invention. 

The present invention will thus be further elucidated by the following examples which are in no way intended to limit 
the scope thereof. 

45 

EXAMPLES 
EXAMPLE 1 

so 1. Introduction 

This example describes the isolation and analysis of 75 overlapping YAC clones and the establishment of a YAC 
contig (set of overlapping clones), which spans about 6 Mb of genomic DNA around locus D12S8 and includes MAR. 
The orientation of the YAC contig on the long arm of chromosome 12 was determined by double-color FISH analysis. 
55 On the basis of STS-content mapping and restriction enzyme analysis, a long range physical map of this 6 Mb DNA 
region was established. The contig represents a useful resource for cDNA capture aimed at identifying genes located 
in 12q15, including the one directly affected by th various chromosome 12 aberrations. 
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2. Materials and method^ 

2.1. Cell lines 

Cell lines PK89-12 and US-3/SV40/A9-B4 were used for Chromosome Assignment using Somatic cell Hybrids 
(CASH) experiments. PK89-12, which contains chromosome 12 as the sole human chromosome in a hamster genetic 
background, has been described before [29]. PK89-12 cells were grown in DME-F12 medium supplemented with 10% 
fetal bovine serum, 200 lU/ml penicillin, and 200 jig/ml streptomycin. Somatic cell hybrid LIS-3/SV40/A9-B4 was 
obtained upon fusion of myxoid liposarcoma ceil line LIS-3/SV40. which carries a t(12;16)(q13;p1 1.2), and mouse A9 
cells and was previously shown to contain der(16), but neither der(12) nor the normal chromosome 12 [30]. LIS- 
3/SV40/A9-B4 cells were grown in selective AOA-medium (AOA-medium which consisted of DME-F12 medium supple- 
mented with 10% fetal bovine serum, 0.05 mM adenine, 0.05 mM ouabain, and 0.01 mM azaserine). Both cell lines 
were frequently assayed by standard cytogenetic techniques. 

2.2. Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the dideoxy chain termination method using a T7 polymerase 
sequencing kit (Pharmacia/LKB) or a dsDNA Cycle Sequencing System (GIBCO/BRL). DNA fragments were sub- 
cloned in pGEM-3Zf(+) and sequenced using FITC-labelled standard SP6 or T7 primers, or specific primers synthe- 
sized based upon newly obtained sequences. Sequencing results were obtained using an Automated Laser 
Fluorescent (A.L.F.) DNA sequencer (Pharmacia Biotech) and standard 30 cm, 6% Hydrolink R , Long Range™ gels (AT 
Biochem). The nucleotide sequences were analyzed using the sequence analysis software Genepro (Riverside Scien- 
tific), PC/Gene (IntelliGenetics), the IntelliGenetics Suite software package (IntelliGenetics, Inc.), and Oligo [31]. All oli- 
gonucleotides were purchased from Pharmacia Biotech. 

2.3. Chromosome preparations and fluorescence in situ hybridization (FISH) 

FISH analysis of YAC clones was performed to establish their chromosomal positions and to identify chimeric 
clones. FISH analysis of cosmid clones corresponding to STSs of YAC insert ends were performed to establish their 
chromosomal positions. Cosmids were isolated from human genomic library CMLW-25383 [32] or the arrayed chromo- 
some 12-specific library constructed at Lawrence Livermore National Laboratory (LL12NC01, ref. 33) according to 
standard procedures [34]. Routine FISH analysis was performed essentially as described before [30, 35]. DNA was 
labelled with biotin-11<JUTP (Boehringer) using the protocol of Kievits et al. [36]. Antifade medium, consisting of 
DABCO (2 g/100 ml, Sigma), 0.1 M Tris-HCL pH 8, 0.02% Thimerosal, and glycerol (90%), and containing propidium 
iodide (0.5 ng/ml, Sigma) as a counterstain, was added 15 min before specimens were analyzed on a Zeiss Axiophot 
fluorescence microscope using a double band-pass filter for FITCHexas red (Omega Optical, Inc.). Results were 
recorded on Scotch (3M) 640 ASA film. 

For the double colour FISH experiments, LLNL12NC01-96C11 was labelled with digoxygenin-11-dUTP (Boe- 
hringer) and cosmids LLNL12NCQMF6 and -193F10, with biotin-11-dUTP. Equal amounts of each probe were com- 
bined and this mixture was used for hybridization. After hybridization, slides were incubated for 20 min with Avidin-FITC 
and then washed as described by Kievrts et al. [36]. Subsequent series of incubations in TNB buffer (0.1 M Tris-HCI pH 
7.5. 0. 1 5 M NaCI, 0.5% Boehringer blocking agent (Boehringer)) and washing steps were performed in TNT buffer (0.1 
M Tris-HCI pH 7.5, 0.15 M NaCI, 0.05% Tween-20); all incubations were performed at 37 °C for 30 min. During the sec- 
ond incubation, Goat-a-Avidin-biotin (Vector) and Mouse-cKJigoxygenin (Sigma) were applied simultaneously. During 
the third incubation, Avidin-FITC and Rabbit-a-Mouse-TRITC (Sigma) were applied. During the last incubation, Goat- 
a-Rabbit-TRITC (Sigma) was applied. After a last wash in TNT buffer, samples were washed twice in 1 x PBS and then 
dehydrated through an ethanol series (70%, 90%, 100%). Antifade medium containing 75 ng^tl DAPI (Serva) as coun- 
terstain was used. Specimens were analyzed on a Zeiss Axiophot fluorescence microscope as described above. 

2.4. Screening of YAC libraries. 

YAC clones were isolated from CEPH human genomic YAC libraries mark 1 and 3 [37, 38] made available to us by 
the Centre d'£tude du Polyphormisme Humain (CEPH). Screening was carried out as previously described [39]. Con- 
taminating Candida parapsvlosis. which was sometimes encountered, was eradicated by adding terbinafin (kindly sup- 
plied by Dr. Dieter ROmer, Sandoz Pharma LTD, Basle, Switzerland) to the growth medium (final concentration: 25 
l*g/ml). The isolated YAC clones were characterized by STS-content mapping, contour-clamped homogeneous electric 
field (CHEF) gel lectrophoresis [40], restriction mapping, and hybridization- and FISH analysis. 
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2.5. PGR reactions 

PCR amplification was carried out using a Pharmacia/LKB Gene ATAQ Controllef (Pharmacia/LKB) in final vol- 
umes of 100 pJ containing 10 mM Tris-HCI pH 8.3, 50 mM KCI. 1 .5 mM MgCI 2 . 0.01% gelatine, 2 mM dNTP's, 20 pmole 
of each amplimer, 2.5 units of Amplitaq (Perkin-Elmer Cetus), and 100 ng (for superpools) or 20 ng (for pods) of DNA. 
After initial denaturation for 5 min at 94 °C, 35 amplification cycles were performed each consisting of denaturation for 
1 min at 94 °C, annealing for 1 min at the appropriate temperature (see Table 0 and extension for 1 min at 72 °C. The 
PCR reaction was completed by a final extension at 72 °C for 5 min. Results were evaluated by analysis of 10 (il of the 
reaction product on pofyacrylamide minigels. 

2.6. Pulsed-field gel electrophoresis and Southern blot analysis 

Pulsed-f ield gel electrophoresis and Southern Wot analysis were performed exactly as described by Schoenmakers 
et al. [39]. Agarose plugs containing high-molecular weight YAC DNA (equivalent to about 1 x 10 8 yeast cells) were 
twice equilibrated in approximately 25 mi TE buffer (pH 8.0) for 30 min at 50 °C followed by two similar rounds of equi- 
libration at room temperature. Plugs were subsequently transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 30 min in 500 \i\ of the appropriate 1 x restriction-buffer at the appropriate restriction temperature. 
Thereafter, DNA was digested in the plugs accordng to the suppliers (Boehringer) instructions for 4 h using 30 units of 
restriction endonuclease per digestion reaction. After digestion, plugs along with appropriate molecular weight markers 
were loaded onto a 1% agarose /0.25 x TBE gel, sealed with LMP-agarose and size fractionated on a CHEF apparatus 
(Biorad) for 18 h at 6.0 V/cm using a pulse angle of 120 degrees and constant pulse times varying from 10 sec (sepa- 
ration up to 300 kbp) to 20 sec (separation up to 500 kbp). In the case of large restriction fragments, additional runs 
were performed, aiming at the separation of fragments with sizes above 500 kbp. Electrophoresis was performed at 14 
°C in 0.25 x TBE. As molecular weight markers, lambda ladders (Promega) and home-made plugs containing lambda 
DNA cut with restriction endonuclease Hindlll were used. After electrophoresis, gels were stained with ethidium bro- 
mide, photographed, and UV irradiated using a stratalinker (Stratagene) set at 120 mJ. DNA was subsequently blotted 
onto Hybond N + membranes (Amersham) for 4-16 h using 0.4 N NaOH as transfer buffer. After blotting, the membranes 
were dried for 15 min at 80 °C, briefly neutralised in 2 x SSPE, and prehybridised for at least 3 h at 42 °C in 50 ml of a 
solution consisting of 50% formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 ^g/ml heparin. Filters were sub- 
sequently hybridised for 16 h at 42 °C in 10 ml of a solution consisting of 50% formamide, 5 x SSPE, 1 x Denhardts, 
0.1% SDS, 100 fig/ml heparin, 0.5% dextran sulphate and 2-3 x 10 6 cpm/ml of labelled probe. Thereafter, membranes 
were first washed two times for 5 min in 2 x SSPE/0.1% SDS at room temperature, then for 30 min in 2 x SSPE/0.1% 
SDS at 42 °C and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65 °C. Kodak XAR-5 films were exposed at - 80 °C for 
3-16 h, depending on probe performance. Intensifying screens (Kyokko special 500) were used. 

2.7. Generation of STSs from YAC insert ends 

STSs from YAC insert ends were obtained using a vectorette-PCR procedure in combination with direct DNA 
sequencing analysis, essentially as described by Geurts et al. [41]. Primer sets were developed and tested on human 
genomic DNA, basically according to procedures described above. STSs will be referred to throughout this application 
by their abbreviated names (for instance: RM1 instead of STS 12-RM1) for reasons of convenience. 

3. Resets 

3.1 . Assembly of a YAC Contig around locus D12S8 

In previous studies [39], a 800 kb YAC contig around D12S8 was described. This contig consisted of the following 
three partially overlapping, non-chimeric CEPH YAC clones: 258F11, 320F6, and 234G11. This contig was used as 
starting point for a chromosome walking project to define the DNA region on the long arm of chromosome 12 that 
encompasses the breakpoints of a variety of benign solid tumors, which are all located proximal to D12S8 and distal to 
CHOP. Initially, chromosome walking was performed bidirectionally until the size of the contig allowed reliable determi- 
nation of the orientation of it. In the bidirectional and subsequent unidirectional chromosome walking steps, the follow- 
ing general procedure was used. First, rescuing and sequencing the ends of YAC clones resulted in DNA markers 
characterizing the left and right sides of these (Table I). Based on sequence data of the ends of forty YAC inserts, primer 
sets were developed for specific amplification of DNA; establishing STSs (Table II). Their localization to 12q13-qter was 
determined by CASH as well as FISH after corresponding cosmid clones were isolated. It should be noted that isolated 
YAC clon s were often evaluated by FISH analysis too, thus not only rev aling the chromosomal origin of their inserts 
but also, for a number of cases, establishing and defining their chim ric natur . Moreover, it should be emphasized that 
data obtained by restriction ndonuclease analysis of overlapping YAC clones wer also taken into account in the YAC 
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clone evaluation and subsequent alignment With the sequentially selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the building blocks for corrtig-assemWy. Therefore, corrtig-ass mWy 
was performed using data derived from FISH- and STS^XKTtent mapping as well as restriction endonuclease analysis 
Using this approach, we established a YAC contig consisting of 75 overlapping YAC clones, covering approximately 6 
Mb of DNA (Rg. 1). This contig appeared to encompass the chromosome 12 breakpoints of ail tumor-derived cell lines 
studied [39]. Characteristics of the YACs that were used to build this contig are given in Table I. 

3.2. Establishment of the Chromosomal Orientation of the YAC Contig 

To allow unidirectional chromosome walking towards the centromere of chromosome 12, the orientation of the DNA 
region flanked by STSs RM14 and RM26 (approximate size: 1450 kb) was determined by double-color interphase FISH 
analysis. Cosmid clones corresponding to these STSs (i.e. LL12NC01-1F6 (RM14) and LL12NC01-96C11 (RM26)) 
were differentially labelled to show green or red signals, respectively. As a reference locus, cosmid LL12NC0M93F10 
was labelled to show green signals upon detection. LL12NC0M93F10 had previously been mapped distal to the 
breakpoint of LIS-3/SV40 (i.e. CHOP) and proximal to the chromosome 12q breakpoints in lipoma cell line U-14/SV40 
and uterine leiomyoma cell line LM-30. 1/SV40. LL12NC01 -1 F6 and LL1 2NC01 -96C1 1 were found to be mapping distal 
to the 12q breakpoints in lipoma cell line LM4/SV40 and uterine leiomyoma cell line LM-30. 1/SV40. Therefore, 
LL12NC01-193F10 was concluded to be mapping proximal to both RM14 and RM26 (unpublished results). Of 150 
informative interphases scored. 18% showed a signal-order of red-green-green whereas 72% showed a signal order of 
green-red-green. Based upon these observations, we concluded that RM26 mapped proximal to RM14, and therefore 
we continued to extend the YAC contig from the RM26 (i.e. proximal) side of our contig only. Only the cosmids contain- 
ing RM14 and RM26 were ordered by double-color interphase mapping; the order of all others was deduced from data 
of the YAC contig. Finally, it should be noted that the chromosomal orientation of the contig as proposed on the basis 
of results of the double-color interphase FISH studies was independently confirmed after the YAC contig had been 
extended across the chromosome 12 breakpoints as present in a variety of tumor cell lines. This confirmatory informa- 
tion was obtained in extensive FISH studies in which the positions of YAC and cosmid clones were determined relative 
to the chromosome 12q13<|15 breakpoints of primary lipomas, uterine leiomyomas, pleomorphic salivary gland ade- 
nomas, and pulmonary chondroid hamartomas or derivative cell lines [24. 42, 25, 43]. 

3.3. Construction of a Rare-Cutter Physical Map from the 6 Mb YAC Contig around D12S8 

Southern Wots of total yeast plus YAC DNA. digested to completion with rare-cutter enzymes (see Materials and 
Methods) and separated on CHEF gels, were hybridized sequentially with i) the STS used for the initial screening of the 
YAC in question, ii) pYAC4 right arm sequences. Hi) pYAC4 left arm sequences, and iv) a human ALU-repeat probe 
(BLUR-8). The long-range restriction map that was obtained in this way. was completed by probing with PCR-isolated 
STSs/YAC end probes. Occasionally double-digests were performed. 

Restriction maps of individual YAC clones were aligned and a consensus restriction map was established. It is 
important to note here that the entire consensus rare-cutter map was supported by at least two independent clones 
showing a full internal consistency. 

3.4. Physical mapping of CA repeats and monomorphic STSs/ESTs 

Based upon integrated mapping data as emerged from the Second International Workshop on Human Chromo- 
some 12 [44]. a number of published markers was expected to be mapping within the YAC contig presented here. To 
allow full integration of our mapping data with those obtained by others, a number of markers were STS content- 
mapped on our contig. and the ones found positive were subsequently sublocalized by (primer-)hybridization on YAC 
Southern Wots. Among the markers that were found to reside within the contig presented here were CA repeats 
D12S313 (AFM207xf2) and D12S335 (AFM273vg9) [45], D12S375 (CHLC GATA3F02), and D12S56 [46]. Further- 
more, the interferon gamma gene (IFNG) [47], the ras-related protein gene Rap1 B [48], and expressed sequence tag 
EST01096 [49] were mapped using primer sets which we developed based on puWicly available sequence data (see 
TaWe II). Markers which were tested and found negative included D12S80 (AFM102xd6), D12S92 (AFM203va7) 
D12S329 (AFM249xh9) and D12S344 (AFM296xd9). 

4. Discussion 

In the present example the estaWishmerrt of a YAC contig and rare-cutter physical map covering approximately 6 
Mb on 12q15. a region on the long arm of human chromosome 12 containing MAR in which a number of recurrent chro- 
mosomal aberrations of benign solid tumors are known to be mapping was illustrated. 
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The extent of overlap between individual YACs has been carefully determined, placing the total length of the contig 
at approximately 6 Mb (Fig. 1). It should be noted that our sizing-data for some of the YAC clones differ slightly from th 
sizes determined by CEPH [50]. It is our belief that this is most probably due to differences in the parameters for running 
the pulsed-f ield gels in the different laboratories. 

5 Using restriction mapping and STS-content analysis, a consensus long range physical map (Fig. 1) was con- 
structed. The entire composite map is supported by at least two-fold coverage. In total over 30 Mb of YAC DNA was 
characterized by restriction and STS content analysis, corresponding to an average contig coverage of about 5 times. 
Although the "inborn" limited resolution associated with the technique of pulsed-fiekJ electrophoresis does not allow 
very precise size estimations, comparison to restriction mapping data obtained from a 500 Id) cosmid contig contained 

10 within the YAC contig presented here showed a remarkable good correlation. Extrapolating from the cosmid data, we 
estimate the accuracy of the rare-cutter physical map presented here at about 10 Kb. 

The results of our physical mapping studies allowed integration of three gene-specific as well as five anonymous 
markers isolated by others (indicated in between arrows in Fig. 1). The anonymous markers include one monomorphic 
and four polymorphic markers. Five previously published YAC-end-derived single copy STSs (RM1, RM4, RMS. RM7, 

75 and RM21) as well as four published CA repeats (D12S56, D12S313, D12S335, and D12S375) and three published 
gene-associated STSs/ESTs (RAP1 B, EST01096, and IFNG) have been placed on the same physical map and this will 
facilitate (linkage-) mapping and identification of a number of traits/disease genes that map in the region. Furthermore, 
we were able to place onto the same physical map, seventy two YAC-end-derived (Table I) and eight cosmid -end- or 
inter-ALU-derived DNA markers (CH9, RM1, RM110, RM111, RM130, RM131, RM132. and RM133). which were 

20 developed during the process of chromosome walking. The PYTHIA automatic mail server at PYTHIA@anl.gov was 
used to screen the derived sequences of these DNA markers for the presence of repeats. For forty three of these sev- 
enty two DNA markers (listed in Table II), primer sets were developed and the corresponding STSs were determined to 
be single copy by PCR as well as Southern blot analysis of human genomic DNA. TTie twenty nine remaining DNA 
markers (depicted in the yellow boxes) represent YAC-end-derived sequences for which we did not develop primer sets. 

25 These YAC-end sequences are assumed to be mapping to chromosome 12 on the basis of restriction mapping. The 
final picture reveals an overall marker density in this region of approximately one within every 70 kb. 

The analysis of the contig presented here revealed many CpG-rich regions, potentially HTF islands, which are 
known to be frequently associated with housekeeping genes. These CpG islands are most probably located at the 5' 
ends of as yet unidentified genes: it has been shown that in 90% of cases in which three or more rare-cutter restriction 

30 sites coincide in YAC DNA there is an associated gene [51]. This is likely to be an underestimate of the number of genes 
yet to be identified in this region because 60% of tissue-specific genes are not associated with CpG islands [52] and 
also because it is possible for two genes to be transcribed in different orientations from a single island [53]. 

While several of the YAC clones that were isolated from the CEPH YAC library mark 1 were found to be chimeric, 
overlapping YAC clones that appeared to be non-chimeric based on FISH, restriction mapping and STS content analy- 

35 sis could be obtained in each screening, which is in agreement with the reported complexity of the library. The degree 
of chimerism fa the CEPH YAC library mark 1 was determined at 18% (12 out of 68) for the region under investigation 
here. The small number of YACs from the CEPH YAC library mark 3 (only 7 MEGA YACs were included in this study) 
did not allow a reliable estimation of the percentage of chimeric clones present in this library. The average size of YACs 
derived from the mark 1 library was calculated to be 381 kb ; non-chimeric YACs (n=58) had an average size of 366 kb 

40 while chimeric YACs (n=12) were found to have a considerable larger average size; i.e. 454 kb. 

In summary, we present a 6 Mb YAC contig corresponding to a human chromosomal region which is frequently 
rearranged in a variety of benign solid tumors. The contig links over 84 loci, including 3 gene-associated STSs. More- 
over, by restriction mapping we have identified at least 12 CpG islands which might be indicative for genes residing 
there. Finally, four CA repeats have been sublocalized within the contig. The integration of the genetic, physical, and 

45 transcriptional maps of the region provides a basic framework for further studies of this region of chromosome 1 2. Initial 
studies are likely to focus on MAR and ULCR12, as these regions contain the breakpoint cluster regions of at least three 
distinct types of solid tumors. The various YAC clones we describe here are valuable resources for such studies. They 
should facilitate the search for genes residing in this area and the identification of those directly affected by the chromo- 
some 12q aberrations of the various benign solid tumors. 

50 

EXAMPLE 2 

1. Introduction 

55 It was found that the 1 .7 Mb Multiple Aberration Region on human chromosome 1 2q1 5 harbors recurrent chromo- 
some 12 breakpoints frequently found in different benign solid tumor types. In this example the identification of an HMG 
gene within MAR that appears to be of pathogenetical relevance is described. Using a positional cloning approach, the 
High Mobility Group protein gene HMGI-C was identified within a 1 75 kb segment of MAR and its genomic organization 
characterized. By FISH, within this gene the majority of the breakpoints of seven different benign solid tumor types were 
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pinpointed. By Southern blot and 3'-RACE analysis, consistent rearrangements in HMGI-C and/or expression of altered 
HMGI-C transcripts were demonstrated. These results indicate a link between a member of the HMG gene family and 
benign solid tumor development. 

5 3, Materials and methods 

2.1. Cell culture and primary tumor specimens. 

Tumor cell lines listed in Fig. 3 were established by a transfection procedure [54] and have been described before 
10 in [39, 24J and in an article of Van de Ven et al. t Genes Chromosom. Cancer 12, 296-303 (1995) enclosed with this 
application as ANNEX 1. Cells were grown in TC199 medium supplemented with 20% fetal bovine serum and were 
assayed by standard cytogenetic techniques at regular intervals. The human hepatocellular carcinoma celt lines Hep 
3B and Hep G2 were obtained from the ATCC (accession numbers ATCC HB 8064 and ATCC HB 8065) and cultured 
in DMEM/F12 supplemented with 4% Ultroser (Gibcc/BRL). Primary solid tumors were obtained from various University 
75 Clinics. 

2.2. YAC and cosmid clones 

YAC clones were isolated from the CEPH mark 1 [57] and mark 3 [58] YAC libraries using a combination of PCR- 
20 based screening [59] and colony hybridization analysis. Cosmid clones were isolated from an arrayed human chromo- 
some 12-specrfic cosmid library (LL12NC01) [60] obtained from Lawrence Liver more National Laboratory (courtesy P. 
de Jong). LL1 2NC01 -derived cosmid clones are indicated by their microtiter plate addresses; i.e. for instance 27E1 2. 

Cosmid DNA was extracted using standard techniques involving purification over Qiagen tips (Diagen). Agarose 
plugs containing high-molecular weight yeast + YAC DNA (equivalent to 1 x 10 9 cells ml" 1 ) were prepared as described 
25 before [61]. Plugs were thoroughly dialysed against four changes of 25 ml T 10 E.| (pH 8.0) followed by two changes of 
0.5 ml 1 x restriction buffer before they were subjected to either pulsed-f ieid restriction enzyme mapping or YAC-end 
rescue. YAC-end rescue was performed using a vectorette-PCR procedure in combination with direct solid phase DNA 
sequencing, as described before in reference 61. Inter-Alu PCR products were isolated using published oligonucle- 
otides TC65 or 517 [62] to which Sail-tails were added to facilitate cloning. After sequence analysis, primer pairs were 
30 developed using the OLIGO computer algorithm [61]. 

2.3. DNA labelling 

DNA from YACs, cosmids, PCR products and oligonucleotides was labelled using a variety of techniques. For 
35 FISH, cosmid clones or inter-Alu PCR products of YACs were biotinylated with biotin-11-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio-labelled with a- 32 P-dCTP using random hexamers [62]. In case 
of PCR-products smaller than 200 bp in size, a similar protocol was applied, but specific oligonucleotides were used to 
prime labelling reactions. Oligonucleotides were labelled using y^P-ATP. 

40 2.4. Nucleotide sequence analysis and PCR amplification 

Nucleotide sequences were determined as described in Example 1 . Sequencing results were analyzed using an 
A.LF. DNA sequencer™ (Pharmacia Biotech) on standard 30 cm, 6% Hydrolink", Long Range™ gels (AT Biochem). 
PCR amplifications were carried out essentially as described before [39]. 

45 

2.5. Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3' cDNA-ends (3'- RACE) was performed using a slight modification of part of the 
GIBCO/BRL 3'-ET protocol. For first strand cDNA synthesis, adapter primer (AP2) AAG GAT CCG TCG ACA TC(T) 17 

so was used. For both initial and secondary rounds of PCR, the universal amplification primer (UAP2) CUA CUA CUA CUA 
AAG GAT CCG TCG ACA TC was used as "reversed primer". In the first PCR round the following specific "forward prim- 
ers" were used: i) 5'-CTT CAG CCC AGG GAC AAC-3' (exon 1). ii) 5'-CAA GAG GCA GAC CTA GGA-3' (exon 3), or iii) 
5'-AAC AAT GCA ACT TTT AAT TAC TG-3' (3*-UTR). In the second PCR round the following specific forward primers 
(nested primers as compared to those used in the first round) were used: i) 5*-CAU CAU CAU CAU CGC CTC AGA AGA 

55 GAG GAC-3' (exon 1), ii) 5'-CAU CAU CAU CAU GTT CAG AAG AAG CCT GCT-3' (exon 4), or iii) 5'-CAU CAU CAU 
CAU TTG ATC TGA TAA GCA AGA GTG GG-3' (3'-UTR). CUA/CAU-tailing of the nested, specific primers allowed the 
use of the directional CloneAmp cloning system (GIBCO/BRL). 
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3. Results 

3.1. Development of cosmid contig and STS map of MAR segment 

5 During the course of a positional cloning effort focusing on the long arm of human chromosome 1 2, we constructed 
a yeast artificial chromosome (YAC) contig spanning about 6 Mb and consisting of 75 overlapping YACs. For a descrip- 
tion thereof reference is made to Example 1 . This contig encompasses MAR (see also Fig. 2), in which most of the chro- 
mosome I2ql3<i15 breakpoints as present in a variety of primary benign solid tumors (34 tumors of eight different 
types tested sofar; Table 5) and tumor cell lines (26 tested sofar, derived from lipoma, uterine leiomyoma, and pleomor- 

10 phic salivary gland adenoma; Fig. 3) appear to cluster. We have developed both a long-range STS and rare cutter phys- 
ical map of MAR and found, by FISH analysis, most of the breakpoints mapping within the 445 kb subregion of MAR 
located between STSs RM33 and RM98 (see Fig. 2 and 3). FISH experiments, including extensive quality control, were 
performed according to routine procedures as described before [25, 39, 24, 42, 36] To further refine the distribution of 
breakpoints within this 445 kb MAR segment, a cosmid contig consisting of 54 overlapping cosmid clones has been 

is developed and a dense STS map (Fig. 2) established. The cosmid contig was double-checked by comparison to the 
rare cutter physical map and by STS content mapping. 

3.2. Clustering of the chromosome 1 2q breakpoints within a 1 75 kb DNA segment of MAR 

20 The chromosome 12q breakpoints in the various tumor cell lines studied was pinpointed within the cosmid contig 
by FISH (Fig. 3). As part of our quality control FISH experiments [25, 39, 24, 42], selected cosmids were first tested on 
metaphase spreads derived from normal lymphocytes. FISH results indicated that the majority (at least 18 out of the 26 
cases) of the chromosome 12 breakpoints in these tumor cell lines were found to be clustering within the 1 75 to DNA 
interval between RM99 and RM133, indicating this interval to constitute the main breakpoint cluster region. FISH results 

25 obtained with U-501/SV40 indicated that part of MAR was translocated to an apparently normal chromosome 3; a chro- 
mosome aberration overseen by applied cytogenetics. Of interest to note, finally, is the fact that the breakpoints of uter- 
ine leiomyoma cell lines LM-5.1/SV40, LM-65/SV40. and LM-608/SV40 were found to be mapping within the same 
cosmid clone; i.e. cosmid 27E12. 

We also performed FISH experiments on eight different types of primary benign solid tumors with chromosome 

30 12q13-q15 aberrations (Table 4). A mixture of cosmid clones 27E12 and 142H1 was used as molecular probe. In sum- 
mary, the results of the FISH studies of primary tumors were consistent with those obtained for the tumor cell lines. The 
observation that breakpoints of each of the seven different tumor types tested were found within the same 1 75 kb DNA 
interval of MAR suggested that this interval is critically relevant to the development of these tumors and, therefore, 
might harbor the putative MAG locus or critical part(s) of it. 

35 

3.3. Identification of candidate genes mapping within MAR 

In an attempt to identify candidate genes mapping within the 1 75 kb subregion of MAR between STSs RM99 and 
RM133. we used 3'-terminal exon trapping and genomic sequence sampling (GSS) [63]. Using the GSS approach, we 

40 obtained DNA sequence data of the termini of a 4.9 kb BamHI subfragment of cosmid 27E12, which was shown by 
FISH analysis to be split by the chromosome 1 2 aberrations in three of the uterine leiomyoma cell lines tested. A BLAST 
[64] search revealed that part of these sequences displayed sequence identity with a publicly available partial cDNA 
sequence (EMBL accession # Z31 595) of the high mobility group (HMG) protein gene HMGI-C [65], which is a member 
of the HMG gene family [66]. In light of these observations, HMGI-C was considered a candidate MAG gene and stud- 

45 ied in further detail. 

3.4. Genomic organization of HMGI-C and rearrangements in benign solid tumors 

Since only 1200 nucleotides of the HMGI-C transcript (reported size approximately 4 kb [65, 67]) were publicly 
so available, we first determined most of the remaining nucleotide sequences of the HMGI-C transcript (Gen Bank, # 
U28749). This allowed us to subsequently establish the genomic organization of the gene. Of interest to note about the 
sequence data is that a CT-repeat is present in the 5'-UTR of HMGI-C and a GGGGT-pentanucleotide repeat in the 3'- 
UTR, which might be of regulatory relevance. Comparison of transcribed to genomic DNA sequences (GenBank, # 
U28750, U28751 , U28752, U28753. and U28754) of the gene revealed that HMGI-C contains at least 5 exons (Fig. 2). 
55 Transcriptional orientation of the gene is directed towards the telomere of the long arm of the chromosome. Each of the 
first three exons encode a putative DNA binding domain (DBD), and exon 5 encodes an acidic domain, which is sepa- 
rated from the three DBDs by a spacer domain encoded by exon 4. The three DBD-encoding exons are positioned rel- 
atively close together and are separated by a large intron of about 140 kb from the two other exons. which in turn are 
separated about 1 1 kb from each other. Of particular interest to emphasize here is that th f iv exons are dispersed 
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over a genomic region of at least 160 kb. thus almost covering the entire 175 kb main MAR breakpoint cluster region 
descrfoed above. Results of molecular cytogenetic studies, using a mixture of cosmids 142H1 (containing exons 1-3) 
and 27E12 (containing exons 4 and 5) as molecular probe, clearly demonstrate that the HMGI-C gene is directly 
affected by the observed chromosome 12 aberrations in the majority of the tumors and tumor cell lines that were eval- 
uated (Fig. 3; Table 4). These cytogenetic observations were independently confirmed by Southern Wot analysis in the 
case of LM-608/SV40 (results not shown) LM-30.1/SV40 [24], and Ad-312/SV40; probes used included CH76, RM1 18- 
A, and EM26. The failure to detect the breakpoints of LM-65/SV40, LM-609/SV40, Ad-21 1/SV40, Ad-263/SV40 Ad- 
302/SV40. LM4/SV40, and IJ-538/SV40 with any of these three probes was also consistent with the FISH data estab- 
hshing the relative positions of the breakpoints in MAR (cf. Fig. 3). These results made HMGI-C a prime candidate to 
be the postulated MAG gene. 

3.5. Expression of aberrant HMGI-C transcripts in benign solid tumor cells. 

In the context of follow-up studies, it was of interest to test for possible aberrant HMGI-C expression. Initial Northern 
blot studies revealed that transcripts of HMGI-C could not be detected in a variety of normal tissues (brain, heart, lung, 
liver, kidney, pancreas, placenta, skeletal muscle) tested as well as in a number of the tumor cell lines listed in Fig. 3 
(data not shown). It is known that HMGI-C mRNA levels in normal differentiated tissues are very much lower than in 
malignant tissues [65, 67], As a control in our Northern studies, we included hepatoma cell line Hep 3B, which is known 
to express relatively high levels of HMGI-C. We readily detected two major HMGI-C transcripts, approximately 3.6 and 
3.2 kb in size; with the differences in molecular weight most likely due to differences in their 5*-noncoding regions. In an 
alternative and more sensitive approach to detect HMGI-C or 3*-aberrant HMGI-C transcripts, we have performed 3'- 
RACE experiments. In control experiments using a number of tissues with varying HMGI-C transcription levels (high lev- 
els in Hep 3B hepatoma cells, intermediate in Hep G2 hepatoma cells, and low in myometrium, normal fet tissue, and 
pseudomyxoma), we obtained 3'-RACE clones which, upon molecular cloning and nucleotide sequence analysis, 
appeared to represent perfect partial cDNA copies of 3'-HMGI-C mRNA sequences; no matter which of the three 
selected primer sets was used (see Methodology). RACE products most likely corresponding to cryptic or aberrantly 
spliced HMGI-C transcripts were occasionally observed; their ectopic sequences were mapped back to HMGI-C intron 
3 or 4. 

In similar 3'-RACE analysis of ten different primary tumors or tumor cell lines derived from lipoma, uterine leiomy- 
oma, and pleomorphic salivary gland adenoma, we detected both constant and unique PCR products. The constant 
PCR products appeared to represent, in most cases, perfect partial cDNA copies of 3'-HMGI-C mRNA sequences. 
They most likely originated from a presumably unaffected HMGI-C allele and might be considered as internal controls. 
The unique PCR products of the ten tumor cell samples presented here appeared to contain ectopic sequences fused 
to HMGI-C sequences. In most cases, the ectopic sequences were found to be derived from the established transloca- 
tion partners, thus providing independent evidence for translocation-induced rearrangements of the HMGI-C gene. 
Information concerning nucleotide sequences, diversion points, and chromosomal origins of the ectopic sequences of 
these RACE products is summarized in Table 5. It should be noted that chromosomal origins of ectopic sequences was 
established by CASH (Chromosome Assignment using Somatic cell Hybrids) analysis using the NIGMS Human/Rodent 
Somatic Hybrid Mapping Panel 2 obtained from the Coriell Cell Repositories. Chromosomal assignment was independ- 
ently confirmed by additional data for cases pCH1 1 1 1 , pCH1 72, pCH1 74, pCH1 93. and pCH1 1 7, as further outlined in 
Table 5. Taking into account the limitations of conventional cytogenetic analysis, especially in cases with complex kary- 
otypes, the chromosome assignments of the ectopic sequences are in good agreement with the previous cytogenetic 
description of the translocations. 

Somewhat unexpected were the data obtained with Ad-312/SV40. as available molecular cytogenetic analysis had 
indicated its chromosome 12 breakpoint to map far outside the HMGI-C gene; over 1 Mb [42]. The ectopic sequences 
appeared to originate from chromosome 1 (more precisely from a segment within M.l.T. YAC contig WC-511. which is 
partially mapping at 1p22), the established translocation partner (Fig. 2). Further molecular analysis is required to pre- 
cisely define the effect on functional expression of the aberrant HMGI-C gene in this particular case. Of further interest 
to note here, is that the sequences coming from chromosome 1 apparently remove the GGGGT repeat observed in the 
3'-UTR region of HMGI-C. as this repeat is not present in the RACE product In contrast, in primary uterine leiomyoma 
LM-#58 (t(8;12)(q24;ql4«j15)), which was shown to have its breakpoint also in the 3'-UTR, this repeat appeared to be 
present in the RACE product Therefore, removal of this repeat is most probably not critical for tumor development. The 
results with primary tumor LM-#168.1, in which the X chromosome is the cytogenetically assigned translocation part- 
ner, revealed that the ectopic sequences were derived from chromosome 14; the preferential translocation partner in 
leiomyoma. It is possible that involvement of chromosome 14 cannot be detected by standard karyotyping in this par- 
ticular case, as turned out to be the case for U-501/SV40. In primary lipoma U-#294 (t(8;12)(q22;q14)), two alternative 
ectopic sequences were detected. Additional CASH analysis using a hybrid cell mapping panel for regional localization 
of probes t human chromosome 8 [68] revealed that these were both derived from chromosome 8q22-qter (Table 5). 
It is very welt possible that these RACE products correspond to alternatively spliced transcripts. Finally, in four of the 
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cases (Table 5, cases pCHl 14, pCH1 10. PCH109. pCHl 16), the RACE products appeared to correspond to cryptic or 
aberrantly spliced HMGI-C transcripts, as the corresponding ectopic sequences were found to be derived from either 
HMGI-C intron 3 or 4. Such RACE products have also been observed in th control experiments described above. In 
conclusion, the detection of aberrant HMGI-C transcripts in the tumor cells provides additional strong support of HMGI- 
5 C being consistently rearranged by the various chromosome 1 2 aberrations. It should be noted that the aberrant HMGI- 
C transcripts in the various cases should be characterized in full length before arty final conclusion can be drawn about 
biological implications. 

A first and preliminary evaluation of isolated ectopic sequences revealed in phase open reading frames of variable 
length, in the case of primary tumor LM-#25, for instance, already the second codon in the ectopic sequences appeared 

w to be a stop codon (Table 5). A note of caution is appropriate here, as sequence data have been obtained only for 
clones that were produced via two rounds of extensive (probably mutations inducing) PCR. For U-501/SV40, it is of 
interest to note that, in Northern Wot analysis, the isolated ectopic sequences detected a transcript of over 10 kb in a 
variety of tissues, including heart, kidney, liver, lung, pancreas, placenta, and skeletal muscle, but not in brain (data not 
shown). As chromosome 3 is the preferred partner in the chromosome I2q13-ql5 translocations in lipomas and the 

is chromosome 3 breakpoints of various lipomas were found to be spanned by YAC clone CEPH192B10, the detected 
transcript might correspond to a putative lipoma-preferred partner gene (LPP). 

4. Discussion 

20 In ANNEX 1 it was demonstrated that the chromosome 12ql3-q15 breakpoints of lipoma, pleomorphic salivary 
gland adenoma, and uterine leiomyoma, irrespective of their cytogenetic assignments in the past to segment q13, q14, 
or q15 of chromosome 12, all duster within the 1.7 Mb DNA interval designated MAR. In support of the claimed clus- 
tering of breakpoints is a recent study by Schoenberg Fejzo et at. [14], identifying a CEPH mega-YAC spanning the 
chromosome 12 translocation breakpoints in two of the three tumor types. In the present study, we have conclusively 

25 demonstrated by FISH analysis that chromosome 12 breakpoints of seven different solid tumor types are clustering 
within a relatively small (175 kb) segment of MAR. For some tumor cell lines, Southern blot data were obtained and 
these were always in support of the FISH results. From all these observations, we conclude that this segment of MAR 
constitutes a major target area for the chromosome 12 aberrations in these tumors and that it is likely to represent the 
postulated MAG locus: the multi-tumor aberrant growth locus that might be considered as common denominator in 

30 these tumors. 

Within the 175 kb MAR segment, we have identified the HMGI-C gene and determined characteristics of its 
genomic organization. Structurally, the HMGI-C-encoded phosphoprotein consists of three putative DNA binding 
domains, a spacer region, and an acidic carboxy-terminal domain, and contains potential sites of phosphorylation for 
both casein kinase II and p34/cdc2 [65, 67]. We have provided strong evidence that HMGI-C is a prime candidate target 

35 gene involved in the various tumor types studied here. In FISH studies, the breakpoints of 29 out of 33 primary tumors 
were found to be mapping between two highly informative cosmids 142H1 and 27E12; the first one containing the three 
DBD-encoding exons and the second one, the remaining exons that code for the two other domains. Therefore, the 
majority of the breakpoints map within the gene, most of them probably within the 1 40 to intron (intron 3), which is also 
in line with FISH results obtained with the 26 tumor cell lines that were evaluated. It should also be noted that the 5'- 

40 end of the HMGI-C gene is not yet fully characterized. As HMGI(Y), another member of this gene family, is known to 
possess various alternative first exons [69], the size of the HMGI-C gene might be larger than yet assumed. Further 
support that HMGI-C is affected by the chromosome 12 aberrations can be deduced from the results of the 3'-RACE 
experiments. Aberrant HMGI-C transcripts were detected in tumor cells, consisting of transcribed HMGI-C sequences 
fused to newly acquired sequences, in most cases clearly originating from the chromosomes that were cytogeneticaliy 

45 identified as the translocation partners. It is noteworthy that many chromosomes have been found as translocation part- 
ner in the tumors studied. This observed heterogeneity in the reciprocal breakpoint regions involved in these transloca- 
tions resembles that of a variety of hematological malignancies with chromosome 1 1q23 rearrangements involving the 
MLL gene [70], the translational product of which carries an amino-terminai motif related to the DNA-binding motifs of 
HMGI proteins. 

so An intriguing issue pertains to the effect of the chromosome 1 2 aberrations on expression of the HMGI-C gene and 
the direct physiological implications. Some functional characteristics of HMGI-C are known or may be deduced specu- 
latively from studies of other family members; As it binds in the minor groove of DNA, it has been suggested that HMGI- 
C may play a role in organising satellite chromatin or act as a transcription factor [71 . 72]. Studies on HMGIfY), which 
is the member most closely related to HMGI-C, have suggested that HMGI(Y) may function as a promoter-specific 

55 accessory factor for NF-k B transcriptional activity [73]. HMGIfY) has also been shown to stimulate or inhibit DNA bind- 
ing of distinct transcriptional factor ATF-2 isoforms [74]. Both studies indicate that the protein may simply constitute a 
structural component of the transcriptional apparatus functioning in promoter/enhancer contexts. In a recent report on 
high mobility group protein 1 (HMG1), yet another member of the HMG gene family with a similar domain structur as 
HMGI-C and acting as a quasi-transcription factor in gen transcription, a truncated HMG1 prot in lacking the acidic 
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carboxy-terminal region was shown to inhibit gene transcription [751. K was put forward that the acidic terminus of the 
HMG1 molecule is essential for the enhancement of gene expression in addition to elimination of the repression caused 
by the DNA binding. As most of the chromosome 12 breakpoints seem to occur in the 140 kb intron, separation of the 
DBDs from the acidic carboxy-terminal domain seems to occur frequently. In case the acidic domain in HMGI-C has a 

5 similar function as the one in HGMI(Y), the result of the chromosome 12 aberrations is likely to affect gene expression. 
Finally, it should be noted that the fate of th sequences encoding the acidic carboxy-terminal region is not yet known. 

As HMGI-C is the first member of the HMG gene family that might be implicated in the development of benign 
tumors, the question arises as to whether other members of this family could also be involved. The HMG protein family 
consists of three subfamilies: i) the HMG1 and 2 type proteins, which have been found to enhance transcription in vitro 

10 and may well be members of a much larger class of regulators with HMG boxes; ii) the random-coil proteins HMG14 
and 17 with an as yet unknown function; iii) the HMGI-type proteins, which bind to the minor groove and include HMGI- 
C, HMGI, and HMGI-Y; the latter two are encoded by the same gene. It is of interest to note that published mapping 
positions of members of the HMG family coincide with published chromosome breakpoints of benign solid tumors such 
as those studied here. The HMGI(Y) gene, for instance, has been mapped to human chromosome 6p21 [69], which is 

is known to be involved in recurrent translocations observed in uterine leiomyoma, lipoma, and pleomorphic salivary gland 
adenoma [76]. As listed in the Human Genome Data Base, not all known members of the HMG family have been chro- 
mosomally assigned yet, although for some of them a relatively precise mapping position has been established. For 
instance, HMG17 to chromosome 1p36.1-p35, HMG1 L to 13q12, and HMG14 to 21q22.3; all chromosome segments 
in which chromosome breakpoints of the tumor types studied here have been reported [76]. Whether HMGI(Y) or any 

20 other of these HMG members are indeed affected in other subgroups of these tumors remains to be established. Of 
interest to mention, furthermore, are syndromes such as Bannayan-Zonana (McKusick #153480), Proteus (McKusick 
#1 76920), and Cowden (McKusick #1 58350); the latter syndrome is also called multiple hamartoma syndrome. In 60% 
of the individuals with congenital Bannayan-Zonana syndrome, a familial macrocephaly with mesodermal hamartomas, 
discrete lipomas and hemangiomas were found [70]. 

25 Finally, one aspect of our results should not escape attention. All the tumors that were evaluated in this study were 
of mesenchymal origin or contained mesenchymal components. It would be of great interest to find out whether the 
observed involvement of HMGI-C is mesenchyme-spedfic or may be found also in tumors of non-mesenchymal origin. 
The various DNA clones we describe here are valuable resources to address this important issue and should facilitate 
studies to conclusively implicate the HMGI-C gene in tumorigenests. 

30 

EXAMPLE 3 

Hybrid HMGI-C in lipoma cells. 

35 cDNA clones of the chromosome 3-derived lipoma-preferred partner gene LPP (>50 kb) were isolated and the 
nucleotide sequence thereof established. Data of a composite cDNA are shown in Fig. 4. An open reading frame for a 
protein (612 amino acids (aa)) with amino acid sequence similarity (over 50%) to zyxin of chicken was identified. Zyxin 
is a member of the LIM protein family, whose members all possess so-called LIM domains [78]. LIM domains are 
cysteine-rich, zinc-binding protein sequences that are found in a growing number of proteins with divers functions, 

40 including transcription regulators, proto-oncogene products, and adhesion plaque constituents. Many of the UM family 
members have been postulated to play a role in cell signalling and control of cell fate during development Recently, it 
was demonstrated that LIM domains are modular protein-binding interfaces [79]. Like zyxin, which is present at sites of 
cell adhesion to the extracellular matrix and to other cells, the deduced LPP-encoded protein (Fig. 6) possesses three 
LIM domains and lacks classical DNA-binding homeodomains. 

45 In 3'-RACE analysis of U-501/SV40, a HMGI-C containing fusion transcript was identified from which a hybrid pro- 
tein (324 aa) could be predicted and which was subsequently predicted to consist of the three DBDs (83 aa) of HMGI- 
C and, carboxy-terminally of these, the three LIM domains (241 aa) encoded by LPP. In PCR analysis using approriate 
nested amplimer sets similar HMGI-C/LPP hybrid transcripts were detected in various primary lipomas and lipoma cell 
lines carrying a t(3;12) and also in a cytogenetically normal lipoma. These data reveal that the cytogeneticaliy detecta- 

so ble and also the hidden t(3;12) translocations in lipomas seem to result consistently in the in-phase fusion of the DNA- 
binding molecules of HMGI-C to the presumptive modular protein-binding interfaces of the LPP-encoded protein, 
thereby replacing the acidic domain of HMGI-C by LIM domains. Consequently, these protein-binding interfaces are 
most likely presented in the nuclear environment of these lipoma cells, where they might affect gene expression, pos- 
sibly leading to aberrant growth control. Out of the large variety of benign mesenchymal tumors with chromosome 

55 1 2q 1 3-q 1 5 aberrations, this is the first example of a chromosome translocation partner contributing recurrently and con- 
sistently to the formation of a well-defined tumor-associated HMGI-C fusion protein. 
Figure 5 shows the cDNA sequence of the complete isolated LPP gene. 
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EXAMPLE 4 

Diagnostic test for lipoma 

A biopsy of a patient having a lipoma was taken. From the material thus obtained total RNA was extracted using 
the standard TRIZOL™ LS protocol from GIBCO/BRL as described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse transcriptase (GIBCO/BRL) and an oligo dT(1 7) primer con- 
taining an attached short additional nucleotide stretch. The sequence of the primer used is as described in Example 2, 
under point 2.5.. RNase H was subsequently used to remove the RNA from the synthesized DN A/RNA hybrid molecule. 
PCR was performed using a gene specific primer (Example 2, point 2.5.) and a primer complementary to the attached 
short additional nucleotide stretch. The thus obtained PCR product was analysed by get electrofbresis. Fusion con- 
structs were detected by comparing them with the background bands of normal cells of the same individual. 

In an additional experiment a second round of hemi-nested PCR was performed using one internal primer and the 
primer complementary to the short nucleotide stretch. The sensitivity of the test was thus significantly improved. 
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Figure 8 shows a typical gel. 

TABLE I 



ANALYSIS OF YAC CLONES 



10 



15 



25 



30 



35 



50 



CcPH-Code 


Size (kb) 


Landmark left 




Landmark right # 


Chimeric 




71 K 
1 10 






[RM10] 




YES (L + R) 


7HP1 
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ND 
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1 OQU 1 £. 
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U«9UOw 
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234(31 1 
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U29046 






WD 
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o/ ana 


90fl 
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£&CC1U 


01U 


romtici 
IfiMisj 




RM16 


U29048 


YES (g 


10 llsO 


ATA 






RM26 


U29045 
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lUr U t 
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rVU 


AQQC5 


320 


RM44 




RM46 


U29037 


Kin 




9R5 










Kin 

INU 








1 (90/Y41 






Kin 


lOO ww 


O IU 


rRmuoi 




RM65 


1 1 AAA A A 

U29042 


TfcO {LJ 






DUen 
niviou 


U£9uOv 


KMOO 


U29040 


Kin 

INU 


106E8 


id/1 


RMC7 


1 190/133 
U&SUOO 


RM63 


U 29038 


Kin 

INU 


55G1 


365 


RM56 




RM62 


U 29039 


KID 




17A 


DUQC 

rtjvioo 


1 I90/19A 


RM80 


U29036 


Kin 

INU 




9oa 






RM81 


U29Q26 


Kin 

INU 




2UU 


QU70 


1 190/VaA 


RM82 


U29029 


Kin 

INU 




low 


rDHJ70l 




RM83 


U29027 






99* 


IRMfiTl 

jnjvio/ j 




RM84 


U29032 


yp^ n i 


138F3 


AfiA 




U&9U£0 


RM91 


U29019 


INU 


226E7 


CArt 


nr»i*to 


1 190/19 A 


DMCA 


umuio 


NO 


499E9 
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BM51 


t loon ifi 






YES (R) 


312F10 


560 


[RM50] 




rlNltw 


I I9QA91 


YES (U 


82SG7 


950 








ND 


34B5 
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RM88 


U29020 


PMAQ 




ND 


94A7 
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YES (R) 


305B2 
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YES flJ 


379H1 
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RM104 


U29014 


RM105 


U29009 


ND 


444E6 


350 


RM92 


U29017 


RM93 


U29010 


ND 


446H3 


370 


RM94 


U29011 


RM95 


U29018 


ND 


403B12 


380 










ND 


261 E5 


500 


RM102 


U29012 


RM103 


U26689 


ND 


78B11 


425 










ND 


921B9 


1670 










ND 


939H2 


1750 










ND 


188H7 


360 










ND 


142F4 


390 










ND 


404E12 


360 










ND 


164A3 


375 










ND 


244B12 


415 


RM106 


U29007 


RM107 


U29008 


ND 


275H4 


345 


RM108 


U29004 


RM109 


U29005 


ND 


320F9 


370 










ND 


51 F8 


450 










ND 


242A2 


160 


CHI 


U29006 






ND 



55 
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TABLE I ( ontinued) 
ANALYSIS OF YAC CLONES 



is 



20 



25 



30 



253H1 


400 


303F11 


320 


322C8 


410 


208G12 


370 


341 C1 


270 


3S4F1 


270 


452E1 


270 


41A2 


310 


934D2 


1370 


944E8 


1290 


2G11 


350 


7CC07 


1390 


365A12 


370 


803C2 


1080 


210C1 


395 


433C8 


360 


I 402A7 


500 


J 227E8 


465 


j 329F9 


275 


261 E6 


395 


| 348F2 


370 


I 6F3 


320 


j 59F12 


430 


J 265H3 


300 









U29003 




U^90U2 


RM97 


U27135 


RM98 




HM99 


U27130 


CHS 


U271 36 










CHS 




-RM70 


U26998 


RM86 


U27133 


RM73 


U29000 


RM76 


U27132 


RM41 


-U28994 


[RM42] 




RM53 


U27134 


RM55 


U28996 


RM72 


U28793 


RM75 


U28997 


[RM71J 




RM74 


U28995 






[RM136] 




RM35 


U27140 


RM36 


U27141 


RM34 


U28794 


RM33 


U27131 






RM40 


U28999 



ND 
ND 
ND 
ND 
ND 
ND 
ND 
ND 
ND 
ND 
.ND 

YES (4 

ND 

ND 

ND 

NO 

YES (R) 

ND 

ND 

YES(L) 
YES (R) 




YAC clones were isolated from CEPH YAC libraries as described In Materials and Methods. ND: not 
detected by methods used. Landmarks not mapping within the 6 Mb contig have been bracketed 
GenBank accession numbers are given (#). 
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55 
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TABLE II 
PCR Primers 



5 



• 


STS name 


Nucleotide 


Product size 






(STS 12-) 


sequence 5'-3* 


(bp) 




10 


CHI 


TGGGACTAACGGATTTTCAA 
TGTGGTTCATTCATGCATTA 


213 


58 




CH2 


TCCATCATCATCTCAAAACA 
CTCTACCAAATGGAATAAACAG 


145 


58 


15 


CHS 


GCAGCTCAGGCTCCTTCCCA 


143 


58 




TGGCTTCCTGAAACGCGAGA 








CH8 


TCTCCACTGCTTCCATTCAC 
ACACAAAACCACTGGGGTCT 


147 


58 




CH9 


CAGCTTTGGAATCAGTGAGG 
CCTGGGGAAGAGGAGTAAAG 


262 


. 58 


20 


RM1 


GAGCTTCCTATCTCATCC 
ATGCTTGTGTGTGAGTGG 


308 


60 




RM4 


TTTGCTAAGCTAGGTGCC 
AGCTTCAAGACCCATGAG 


236 


60 




RMS 


CAGTTCTGAGACTGCTTG 


324 


60 


25 




TAATAGCAGGGACTCAGC 








RM7 


CTTGTCTCATTCTTTTAAAGGG 
CACCCC1TTTTAGATCCTAC 


S38 


58 




RM1 3 


GAATGTTCATCACAGTGCTG 
AATGTGAGGTTCTGCTGAAG 


+500 


58 


30 


RM14 


TTCTCATGGGGTAAGGACAG 


158 


58 




AAAGCTGCTTATATAGGGAATC 








RM16 


CCTTGGCTTAGATATGATACAC 
GCTCTTCAGAAATATCCTATGG 


252 


58 




RM21 


CCTTAGCAGTTGCTTGTCTG 
TCGTCACAGGACATAGTCAC 


290 


58 


35 


RM26 


TCTATGGTATGTTATACAAGATG 
CAGTGAGATCCTGTCTCTA 


102 


58 




BM31_ 


TCTGTGATGTTTTAAGCCACTTAG 
AATTCTGTGTCCCTGCCACC . 




DO 




RMjo 


ATTCTTCCTCAwCTCUUACU 




ou 


40 




AATCTGCAGAGAGGTCCAGC 








RM34 


AATTCTCCATCTGGGCCTGG 
GAACGCTAAGCATGTGGGAG 


±600 


60 




RM36 


CTCCAACCATGGTCCAAAAC 
GACCTCCAGTGGCTCTTTAG 


296 


60 


45 


RM46 


ACCATCAGATCTGGCACTGA 


241 


57 




TTACATTGGAGCTGTCATGC 








RM48 


TCCAGGACATCCTGAAAATG 
AGTATCCTGCACTTCTGCAG 


391 


58 




RM51 


GATGAACTCTGAGGTGCCTTC 


311 


60 


50 




TCAAACCCAGCTTTGACTCC 






RM53 


GTCTTCAAAACGCTTTCCTG 


333 


60 



TGGTTTGCATAATGGTGATG 
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TABLE II (continued) 
5 PCR Primers 



STS name Nucleotide Product size 

(STS 12-) sequence 5*-3' (bp) 

10 ~~ ~" ■— 



RM60 


TACACTACTCTGCAGCACAC 


94 


58 




TCTGAGTCAATCACATGTCC 




RM69 


CTCCCCAGATGATCTCTTTC 


236 


58 




CGGTAGGAAATAAAGGAGAG 




RM72. 


TATTTACTAGCTGGCCTTGG 


101 


62 




CATCTCAGGCACACACAATG 




RM76 


ATTCAGAGAAGTGGCCAAGT 


496 


58 




GGGATAGGTCTTCTGCAATC 




RM85 


TCCAACAATACTGAGTGACC 


435 


58 




TCCATTTCACTGTAGCACTG 




RM86 


GTAATCAACCATTCCCCTGA 


203 


56 




AAAATAGCTGGTATGGTGGC 




BM90 


ACTGCTCTAGTTTTCPAAGGA 


257 


58 




AATTTACCTGACAGTTTCCT 




RM93 


GCATTTGACGTCCAATATTG 


347 


60 




ATTCCATTGGCTAACACAAG 




RM98 


GCAAAACTTTGACTGAAACG 
CACAGAGTATCGCACTGCAT 


356 


58 


RM99 


AAGAGATTTCCCATGTTGTG 


240 


58 




CTAGTGCCTTCACAAGAACC 




RM103 


AATTCTTGAGGGGTTCACTG 
TCCACACTGAGAGCTTTTCA 


199 


60 


RM108 


GTGGTTCTGTACAGCAGTGG 
TGAGAAAATGTCTGCCAAAT 


439 


60 


RM110 


GCTCTACCAGGCATACAGTG 
ATTCCTAGCATCTTTTCACG 


328 


58 


RM111 


ATATGCATTAGGCTCAACCC 
ATCCCACAGGTCAACATGAC 


312 


58 


RM130 


ATCCTTACATTTCCAGTGGCATTCA 


336 


58 




CCCAGAAGACCCACATTCCTCAT 






RM131 


TTTTAAGTTTCTCCAGGGAGGAGAC 
AATAGGCTCTTTGGAAAGCTGGAGT 


226 


58 


RM132 


TCTCAGCTTAATCCAAGAAGGACTTC • 
GGCATATTCCTCAACAATTTATGCTT 


376 


58 


RM133 


TGGAGAAGCTATGGTGCTTCCTATG 
TGACAAATAGGTGAGGGAAAGTTGTTAT 


225 


58 


EST01096 


TCACACGCTGAATCAATCTT 
CAGCAGCTGATACAAGCTTT 


188 


58 


IFNG 


TGTTTTCTTTCCCGATAGGT 
CTGGGATGCTCTTCGACCTC 


150 


52 


RaplB 


CCATCCAACATCTTAAATGGAC 
CAGCTGCAAACTCTAGGACTATT 


149 


58 



50 

STSs were isolated as described in Materials and Methcds, or 
retrieved from literature for EST01096, IFNG, and RaplB. 
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TABLE 4 



FISH mapping of chromosome 12 breakpoints in primary benign solid tumors to a subregion of MAR 


Tumor type 


Breakpoint within MAR 


Fraction of tumors with breakpoints 
within main breakpoint cluster region* 


Lipoma 


6/6 


6/6 


Pleomorphic salivary gland adenoma 


7/7 


5/7 


Uterine leiomyoma 


7/8 


7/8 


Hamartoma of the breast 


1/1 


1/1 


Fibroadenoma of the breast 


1/1 


1/1 


Hamartoma of the lung 


8/9 


8/9 


Angiomyxoma 


1/1 


1/1 



* Tumor samples were collected and analyzed at the histopaihobgy and cytogenetics facilities of the University of 
Bremen. A mixture of cosmid clones 27E12 and 142H1 was used as molecular probe in FISH analysis. 
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LEGENDS TO THE FIGURES 
Figure 1 

Long range physical map of a 6 Mb region on the long arm of human chromosome 1 2 deduced from a YAC contig 
consisting of 75 overlapping CEPH YAC clones and spanning the chromosome 12q breakpoints as present in a variety 
of benign solid tumors. The long range physical map of the composite genomic DNA covered by the YAC inserts is rep- 
resented by a black solid line with the relative positions of the various restriction sites of rare cutting enzymes indicated. 
DNA regions in which additional cutting sites of a particular restriction enzyme might be found are indicated by arrows. 
Polymorphic restriction endonuclease sites are marked with asterisks. DNA markers isolated and defined by others are 
depicted in green. DNA markers obtained by us are shown in boxes and are labelled by an acronym (see also Table I 
and II). The relative positions of these DNA markers in the long range physical map are indicated and those correspond- 
ing to particular YAC ends are linked to these by a dotted line. Some of the DNA markers have been assigned to a DNA 
interval and this is indicated by arrows. For DNA markers in white boxes STSs have been developed and primer sets 
are given in Table II. For those in yellow boxes, no primer sets were developed. The DNA intervals containing RAP1B, 
EST01 096, or IFNG are indicated. Where applicable, D number assignments are incficated. Below the long range phys- 
ical map, the sizes and relative positions of the overlapping YAC clones fitting within the consensus long range restric- 
tion map are given as solid blue lines. DNA regions of YAC inserts not fitting within the consensus long range restriction 
map are represented by dotted blue lines. CEPH microtiter plate addresses of the YAC clones are listed. The orientation 
of the YAC contig on chromosome 12 is given. The relative positions of ULCR12 and MAR are indicated by red solid 
lines labelled by the corresponding acronyms. Accession numbers of STSs not listed in Table I: CH9 (#U27142); RM1 
(#U29049); RM1 10 (#1129022); RM1 1 1 (#1)29023); RM130 (#U27139); RM131 (#1129001); RM132 (#U27138); RM133 
(#1127137). Restriction sites: B: BssHII; K: Kspl (=Sacll); M: Mlul; N: Not); P: Pvul; Sf: Sfil. 

Figure 2 

Contig of overlapping cosmids, long range restriction and STS map spanning a segment of MAR of about 445 kb. 
Contig elements are numbered and defined in the list below. LL1 2 NC01 -derived cosmid clones are named after their 
microtiter plate addresses. GenBank accession numbers (#) of the various STSs are listed below. STSs are given in 
abbreviated form; e.g. RM33 instead of STS 12-RM33. A 40 kb gap between STSs "K" and "O" in the cosmid contig 
was covered by ■ clones (clones 38 and 40) and PCR products (clones 37 and 39). The orientation of the contig on the 
long arm of chromosome 12 is given as well as the order of 37 STSs (indicated in boxes or labelled with encircled cap- 
ital letters). The slanted lines and arrows around some of the STS symbols at the top of the figure mark the region to 
which the particular STS has been assigned. It should be noted that the cosmid contig is not scaled; black squares indi- 
cate STSs of cosmid ends whereas the presence of STSs corresponding to internal cosmid sequences are represented 
by dots. Long range restriction map: Bs: BssHII; K: Kspl (=Sacll); M: Mlul; N: Notl; P: Pvul; Sf : Sfil. At the bottom of the 
figure, detailed restriction maps are shown of those regions containing exons (boxes below) of the HMGI-C gene. Non- 
coding sequences are represented by open boxes and coding sequences by black boxes. Estimated sizes (kb) of 
introns are as indicated. The relative positions of the translation initiation (ATG) and stop (TAG) codons in the HMGI-C 
gene as well as the putative poly-adenylation signal are indicated by arrows. Detailed restriction map: B: BamHI; E: 
EcoRI; H: Hindi)). MAR: Multiple Aberration Region; DBD: DNA Binding Domain. 



1=140A3 11=142G8 21=124D8 31=59A1 41=128A2 51=65E6 
2=202A1 12=154A10 22=128A7 32=101D8 42=142H1 52=196E1 
3=73F11 13=163D1 23=*129F9 33=175C7 43=204A10 53=215A8 
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4=80C9 14=42H7 24=181C1 34=185H2 44=145E1 54=147G8 

5=109B12 15=113A5 25=238E1 35=189C2 45=245E8 55=*211A9 

6=148C12 16-191H5 26=69B1 36-154B12 46=*154F9 56-22D8 

7=14H6 17=248E4 27»260C7 37=pRM150 47=62D8 57-116B7 

8=51F8 18=33H7 28=156A4 38=pRM144 48=104A4 58-144D12 

9=57C3 19«50D7 29=27E12 39=PKXL 49=184A9 

10=86A10 20=68B12 30=46G3 40=pRM147 50=56C2 



A 




STS 


12 


-EM12 (#U27145) 


I 




STS 


12 


-CH12 


(#U27153) 


Q 




STS 


12- 


-RM120 (#U27161) 


B 




STS 


12 


-EM30 


(#U27146) 


J 




STS 


12- 


-EM10 (#U27154) 


R 




STS 


12 


-RM118 (#U27162) 


C 




STS 


12 


-EM14 (#U27147) 


K 




STS 


12 


-EM37 


(#U27155) 


S 




STS 


12 


-RM119 (#U27163) 


D 




STS 


12 


-EM31 


(#U27148) 


L 




STS 


12 


-RM146 (#U27156) 


T 




STS 


12 


-EM2 


(#U27164) 


E 




STS 


12 


-CH11 (#U27149) 


M 




STS 


12 


-RM145 (#U27157) 


U 




STS 


12 


-EM4 (#U27165) 


F 


a 


STS 


12 


-EM18 


(#U27150) 


N 




STS 


12 


-RM151 (/U27158} 


V 


as 


STS 


12 


-EM3 


(#U27166) 


G 




STS 


12 


-EMU (#U27151) 


0 




STS 


12 


-EM16 


(#U27159) 


W 




STS 


12 


-EM15 (#U27167) 


H 




STS 


12 


-CHIO 


(#U27152) 


P 




STS 


12 


-EMI (#U27160) 


X 


=2 


STS 


12 


-EM17 


(#U27168) 



STS 12-CH5 (#U27136) STS 

STS 12-RM33 (#U27131) STS 

STS 12-RM76 (#U27132) STS 

STS 12-RM98 (#U26647) STS 

STS 12-RM103 (#U26689) STS 

STS 12-RM132 (#U27138) STS 
STS 12-RM151 (#U27158) 



12-CH9 (#U27142) 
12HRM53 (#U27134) 
12-RM86 (#U27133) 
12-RM99 (#U27130) 
12-RM130 (#U27139) 
12-RM133 (#U27137) 



Rgure 3 

Schematic representation of FISH mapping data obtained for tumor cell lines with chromosome 12q13-q15 aberra- 
tions, including 8 lipoma, 10 uterine leiomyoma, and 8 pleomorphic salivary gland adenoma cell lines in consecutive 
experiments following our earlier FISH studies. Probes used included phage clones pRM144 (corresponding STSs: 
RM86 and RM130) and pRM147 (RM151), and cosmid clones 7D3 or 152F2 (RM103), 154F9 (CH9), 27E12 (EM1 1), 
211A9 (RM33), 245E8 (RM53). 185H2 (RM76), 202A1 (RM98). 142H1 (RM99), 154B12 (RM132), and 124D8 
(RM133). The DNA interval between RM33 and RM98 is estimated to be about 445 kb. Dots indicate conclusive FISH 
experiments that were performed on metaphase chromosomes of a particular cell line using as molecular probe, a 
clone containing the STS given in the box above. Solid lines indicate DNA intervals to which a breakpoint of a particular 
cell line was concluded to be mapping. Open triangles indicate deletions observed during FISH analysis. Open circles 
indicate results of FISH experiments on metaphase chromosomes of U-501/SV40 cells with hybridization signals on a 
cytogenetically normal chromosome 3. The positions of chromosome 12 breakpoints of tumor cell lines mapping out- 
side MAR are indicated by arrows. The molecularly cloned breakpoints of LM-30.1/SV40 and LM-608/SV40 are indi- 
cated by asterisks. Breakpoints in various uterine leiomyoma cell lines splitting cosmid 27E12 (EMt 1) are indicated by 
"across". 

Rgure 4 

3'- RACE product comprising the junction between part of the HMGI-C gene and part of the LPP gene. The primers 
used and the junction are indicated. The cDNA synthesis was internally primed and not on the true poly(A) tail. 

Figure 5 

Partial cDNA sequence of the LPP gene. 
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Figure 6 

Amino acid sequence of the LPP gene. LIM domains are boxed. The breaking point is indicated with an arrow. 
Figure 7 

Nucleotide sequence if HMGI-C (U28749). The transcription start site indicated as proposed by Manfioletti et al. 
[67] was arbitrarily chosen as a start sita The sequence contains the complete coding sequence. 

Figure 8 

Gel of PCR products obtained as described in Example 4. 
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ANNEX 1 

GENES. CHROMOSOMES d CANCER I (1993) 



Molecular Characterization of MAR, a Multiple Aberration 
Region on Human Chromosome Segment 1 2q 1 3-q 1 5 
Implicated in Various Solid Tumors 

WIm J.M. Van do Von, Erie F.PJi. Schoenmakers, Sylk* Wanschura, Bornd Kaxmiorcxak, Patrick F.j. Koob, 
Jan M.W. G«urts, Sabine Bartnitxke, Herman Van den Bergh*, and Jorn BuJkrdlek 

Center for Hunan Genetics, University of Leuven. Leuven. Bdgwn (WJ Jh.VjLV« EJWtS, fcfJJC. JJtW.G, HLVdajc 
Center for Human Genetics. University of Bremen. Bremen. Germany (S.W, BJC SJL J.B.) 

Chromosome arm 1 2q breakpoint in seven cell lines derived from primary pleomorphic saBvary (hnd adenomas were mapped 
by FISH analysis relative to nine ONA probes. These probes all reside in a 1A Mb genomic DNA region of chromosome 
segment 1 2q 1 3-q 1 5 arid correspond to previously published sequence-tagged sites (ST5). Their retac^ poackvu were esob- 
lished on the basis of YAC darting and long range physical and STS content mapping. The I2q breakpoints of five of the cett 
lines were found to be mapping within three different subregfpns of the 445 leb ONA interval chat was recently defined as the 
uterine leiomyoma cluster region of chromosome 12 breakpoints (U ICR 1 2) between STS RM33 and W198. AD seven 
breakpoints appeared to map within the 1.7 Mb DNA region between STS RM36 and RMI01 Furthermore, the chromosome 
1 2 breakpoints of three primary pleomorphic salivary gland adenomas were also found to be mapping between RM36 and 
RMI01 Finally, FISH analysis of two lipoma cell lines with 1 2q 1 3-q 1 5 aberrations pinpointed the breakpoints of these to 
relatively small and adjacent DNA segments which, as weU as those of two primary lipomas, appeared to be located also 
between RM36 and RM I OX We conclude from the observed clustering of the 1 2q breakpoints of the three distinct sofid tumor 
types that the 1.7 Mb DNA region of the long arm of chromosome 12 between RM36 and RM 103 is a multiple aberration 
region which we designate MAR. Cents Chromosom Cancer 11296-202 (1995). © 1995 Wley-Uss. inc. 



INTRODUCTION 

Chromosome translocations involving region 
ql3-ql5 of chromosome 12 have been observed in 
a wide variety of solid tumors (Mitelman, 1991). In 
subgroups of cycogenetically abnormal uterine lei- 
omyomas (NUbert and Heim, 1990; Pandis ec ah, 
1991), pleomorphic salivary gland adenomas (San- 
dros ec al., 1990; Bulierdiek et al M 1993), and be- 
nign adipose tissue tumors (Sreekancaiah et al., 
1991), 12ql3-ql5 aberrations are frequently ob- 
served. In a recent study (Schoenmakers ec aL, 
1994b), we identified and molecularly character- 
ized ULCR12. the uterine leiomyoma cluster re* 
gion of chromosome 12 breakpoints. In the present 
study, we focus on the chromosome arm 12q break- 
points in pleomorphic adenoma of the salivary 
glands, a benign epithelial rumor originating from 
the major or minor salivary glands. It is the most 
common type of salivary gland tumor and accounts 
for almost 50% of all neoplasms in these organs. 
About 85% of the tumors are found in che parotid 
gland, 10% in che minor salivary glands, and 5% in 
che submandibular gland (Seifert et al., 1986). Al- 
though many of chese adenomas appear to have a 
normal karyotype, cytogenetic studies have also re- 
vealed recurrent specific chromosome anomalies 
(Sandros ec al., 1990; Bulierdiek cc a!., 1993). Be- 

© 1995 Wn«y.Uss, Irtc 



sides chromosome 8 aberrations, often transloca- 
tions with a breakpoint in 8ql2 with, as che most 
common aberration, a t(3;8Kp21;ql2), aberrations 
of chromosome 12, usually translocations involving 
12ql3-ql5, are also frequent. Non-recurrent clonal 
abnormalities have also been described. The fre- 
quent involvement of region 12ql3-ql5 in distinct 
solid tumor types suggests that this chromosomal 
region harbors gene(s) chat might be implicated in 
che evolution of these tumors. Molecular cloning of 
the chromosome 12 breakpoints of chese tumors 
and characterization of che junction fragments may 
therefore lead to che identification of such gene(s). 

On the basts of fluorescence in situ hybridization 
(FISH) data, we previously reported that the chro- 
mosome 12 breakpoints in a number of cell lines 
derived from primary pleomorphic salivary gland 
adenomas (Kazmierczaketal., 1990; Schoenmakers 
ec al., 1994a) are located on the long arm of chro- 
mosome 12 in che interval between loci D12S19 and 
D12S8 (Schoenmakers ec al., 1994a). This DNA 
interval has been estimated to be about 7 cM (Keats 
ec al., 1989; Craig et al., 1993). The interval con- 
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earning che chromosome 12 breakpoints of these 
tumor cells was narrowed further by showing that all 
breakpoints mapped distaily to the CHOP gene, 
which is directly affected by the characteristic t(12; 
16) translocation in myxoid liposarcomas (Aman ec 
ai. t 1992; Crozatetal., 1993; Rabbiccs et al„ 1993) 
and is located between DlZS19and D12S8. In more 
recent studies (Kools ecil., 1995). the chromosome 
12 breakpoint of pleomorphic salivary gland ade- 
noma cell line Ad-312/SV40 was pinpointed to a 
DNTA region between sequence-tagged sites (STSs) 
RMltO and RMlil, which is less than 165 kb in 
size. FISH evaluation of the chromosome 12 break- 
points of the other pleomorphic salivary gland ad- 
enoma cell lines indicated that they muse be located 
proximally to che one in Ad-312/SV40, at a distance 
of more than 800 kb (Kools et ai., f995). These 
results pointed toward a possible dispersion of che 
chromosome 12 breakpoints over a relatively large 
genomic region on the long arm of chromosome 12. 

Here, we report physical mapping of the chro- 
mosome 12 breakpoints in pleomorphic salivary 
gland adenoma cells from primary tumors as well 
as established tumor cell lines. The karyocypic 
anomalies observed in the cells were all different 
but always involved region ql3-ql5 of chromo- 
some 12. Using DNA probes between D12S8 and 
CHOP, which corresponded to sequence-cagged 
sices (STSs) of a long-range physical map of a 6 Mb 
DNA region and were obtained during chromo- 
some walking experiments, we performed FISH 
experiments and defined more precisely a major 
chromosome 12 breakpoint cluster region of pleo- 
morphic salivary gland adenoma. This breakpoint 
cluster region appeared to overlap with ULCR12. 
Furthermore, we tested whether 12ql3-ql5 break- 
points of lipomas might also map within the same 
region as those of pleomorphic salivary gland ade- 
noma and uterine leiomyoma. 

MATERIALS AND METHODS 
Primary Solid Tumors and Derivative Cell Lines 

Primary solid tumors including pleomorphic sal- 
ivary gland adenomas, lipomas, and uterine leio- 
myomas were obtained from the University Clinics 
in Leuven, Belgium (Dr. I. De Wever); in Bre- 
men, Germany (Dr. R. Chiile); in Krefeld, Ger- 
many (Dr. J. Haubrich); and from the Institute of 
Pathology in Gdceborg, Sweden (Dr. G. Sccnman). 
For cell cul curing and subsequent FISH analysis, 
tumor samples were finely minced, treated for 4-6 
hours with 0.8% collagenase (Boch ringer, Mann- 



TABLE I. Chromosome 12 Aberrations in Primary Human 
Solid Tumors and Cell lines* 



Aberration 



Cell lines 




f\Q-A 1 I/3V1U 


t(o; 1 2}(q2 ( ;q 1 3-q 1 5) 


Ad-248/SV40 


ira(ll6)fqtS:ql6o2n 


Ad-263/3V40 




Ad-29S/SV40 


t(8:l2;l8Kpl2Ml43lia) 


Ad-302/5V40 


t(7:l2Xq3l;ql4) 


Ad-3o£/SV40 


inv<12){pl3qlS) . 


A4-386/SV40 


t(l2;14KqlHlSiql3H,IS) 


U-I4/SV40 


c(3;l2)(q2*ql3) 


U-53S7SV40 


C<3;l2)(q27;ql4) 


LM-S.I/SV40 


rfI2;l5Xql53q24) 


LM-3Q.I/SV40 




LM-6S/SV40 


c(l2:MXql*q24) 


LM-67/SV40 


t(12:t4Kql3^l5^q24) 


LM-I0O/SV40 


t<12:l4XqlS*24) 


LM-60S/SV40 


Ins(llllKqK-q2lqcer) 


LM-6O8/3V40 


t(!2:l4Xql5^24) 


LM-609/SV40 


e(!2;l4KqlS^24) 


Primary tumors 




Ad-386 


o:i2:M)(qlS^MJ) 


Ad-396 


<3:I2) 


Ad-400 


c<lil6) 


U-166 


t(12:l2) 


U-167 


t<3;I2)(q2*ql4-qlS) 


LM-I63.I 


«lil4)<ql4*24) 


LM-I6X2 


t(IZ-14Kql4^24) 


IM- 168.3 


t(XI2KqXtqlS) 


LM-192 


tC2AI2)(qlS$2l;qt4) 


LM-I9&4 


e(l2;l4KqI^24) . 



•Ad. pleomorphic aShnry ghnd uta n uii*. Li, lipoma; LM vtaritm Itia- 



heim, FRG), and processed further for FISH anal- 
ysis according to routine procedures. 

Human tumor cell lines used in this study in- 
cluded the previously described pleomorphic sali- 
vary gland adenoma cell lines Ad«21iySV40,Ad.246V 
SV40,Ad-263/SV40, Ad-293/SV40, Ad-302/SV40, 
Ad-3667SV40, and Ad-386/SV40 (Kazmierczak et 
a!., 1990; Schoenmakers et a!., 1994a) and che li- 
poma cell lines LM4/SV40 (Schoenmakers et aL, 
1994a) and the recendy developed Li-538/SV40. 
Chromosome 12 aberrations found in these cell 
lines are listed in Table 1. Cells were propagated in 
TC199 culture medium with Eagle's salts supple- 
mented with 20% fetal bovine serum. 

DNA Probes 

In the context of a human genome project focus- 
ing on the long arm, of chromosome 12, we isolated 
cosmid clones cRM33. cRM36 ? cRMSl, cRM69, 
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cRM72, cRM76, cRM98, cRM103, and cRM133, 
from chromosome 12*specific arrayed cosmid li- 
brary LLNL12NC01 (Moncgomery ec a!., 1993). 
Further details of these cosmid clones have been 
reported ac the Second International Chromosome 
12 Workshop (1994) and will be described else- 
where (Kucherlapati et al., 1994). Briefly, initial 
screenings were performed using a PCR-bascd 
screening strategy (Green and Olson, 1990), fol- 
lowed by filter hybridization analysis as the final 
screening step, as previously described (Schoen- 
makcrs ec al. ( 1994b). The cosmid clones were iso- 
lated using STSs derived from YAC clones. STSs 
were obtained upon rescue of YAC insert-ends us- 
ing a methodology involving vecrorerte-PCR fol- 
lowed by direct solid phase fluorescent sequencing 
of the PCR products (Geurts et ai. ( . 1994) or from 
ineer-Alu PCR (Nelson et al., 1989). Cosmid 
clones were grown and handled according co stan- 
dard procedures (Sambrook ec al. t 1989). 

Cosmid clone cPK12qter, which maps co the celo- 
meric region of the long arm of chromosome 12 
(Kools ecal., 1995), was used as a reference marker. 

Chromosome Preparations and Fluorescence 
In Situ Hybridization 

Metaphase cells of the pleomorphic salivary 
gland adenoma cell lines or normal human lympho- 
cytes were prepared as described before (Schoen- 
makers ec al., 1993). To unambiguously establish 
che identity of chromosomes in the FISH experi- 
ments, FISH analysis was performed after GTG- 
banding of the same metaphase spreads. GTG- 
banding was performed essentially as described by 
Smic ec al. (1990). In situ hybridizations were car- 
ried out according to a protocol described by 
Kievics ec al. (1990) wich some minor modifications 
(Kools et al., 1994; Schoenmakers et al., 1994b). 
Cosmid and YAC DNA was labelled wich btorin- 
11-dUTP (Boehringer Mannheim) or biorin-14- 
dATP (BRL, Gaithersburg) as described before 
(Schoenmakers ec ah; 1994b). Specimens were an- 
alyzed on a Zeiss Axiophoc fluorescence micro- 
scope using a FITC fiicer (Zeiss). Results were 
recorded on Scotch (3M) 640 asa film. 

RESULTS 

FISH Mapping of I2q Breakpoints in Cell Lines of 
Pleomorphic Salivary Gland Adenoma 

In previous scudies (Schoenmakers et al., 1994a), 
we mapped the chromosome 12 breakpoints in a 
number of pleomorphic adenomas of chc salivary 



glands relative to various DNA markers and estab- 
lished thac chese were all located proximal co locus 
D12S8 and distal to che CrY0?gene. This region is 
somewhat smaller chan chc 7 cM region encom- 
passed by linkage loci DI2S8 and D12S19 (Keacs et 
al., 1989). Using YAC cloning, a long range phys- 
ical/STS map has been constructed covering most of 
chat 7 cM region, as recendy reported (Kucherlapati 
et al. t 1994). Furthermore, numerous genomic 
clones (cosmid clones) have been isolated and their 
relative positions within this map established 
(Kucherlapati ec al., 1994). Nine of these cosmids, 
including cRM33, cRM36, cRM51, cRM69 t 
cRM72, cRM76, cRM98, cRM103, and cRM133, 
were used in FISH scudies co escablish chc positions 
of che chromosome 12 breakpoints of che seven cell 
lines derived from pleomorphic adenomas of die 
salivary glands (Table 1). The relative mapping 
order of chese nine cosmid clones, which cover a 
genomic region on che long arm of chromosome 12 
of abouc 2.8 Mb, is indicated in Figure 1 and the 
results of FISH scudies with che various cosmid 
probes are schematically summarized in che same 
figure. As an illustration, FISH results obtained 
with metaphase cells of cell line Ad-295/SV40 using 
cRM76 and cRM103 as probes are shown in Figure 
2. It should be noted chat for the identification of 
chromosomes, pre-FISH GTG-banding was used 
routinely. On che basis of such banding, hybridiza- 
tion signals could be assigned conclusively co chro- 
mosomes of known identity; this was of major 
importance for cases with cross- or background hy- 
bridization signals, as chese were occasionally ob- 
served. When GTG-banding in combination with 
FISH analysis provided inconclusive resulcs, either 
because of weak hybridization signals or rather 
vague banding, FISH experiments were performed 
with cosmid clone cPKUqter (Kools et aL, 1995) as 
a reference probe. 

FISH analysis of metaphase chromosomes of 
each of che seven pleomorphic salivary gland ade- 
noma cell lines wich cosmid cRMlG3 revealed that 
this cosmid mapped distal to the chromosome 12 
breakpoincs of all seven cell lines studied here. 
Metaphase chromosomes of six of che seven celt 
lines were also cesced wich probe cRM69 and, in 
two cases, with cRMol. The results of che latter 
experiments were always consistent wich chose ob- 
tained wich cRMl03. Similar FISH analysis with 
cRM36 as probe indicated that this probe mapped 
proximal co all che breakpoints. These results were 
always consistent with chose obtained for five of 
che seven cell lines in experiment using cRM72. 
Alcogecher, che resulcs of our FISH scudies indi- 
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cated chac the chromosome 12 breakpoints of all 
seven cell lines map between cRM36 and cRMl03, 
which spans a genomic region of about 1.7 Mb. 

Fine Mapping of I2q Breakpoints In Co.ll Lines 
Derived From Pleomorphic Adenomas of the 
Salivary Glands 

For subsequent fine mapping of the chromosome 
12 breakpoints of the seven pleomorphic salivary 
gland adenoma cell lines, additional FISH studies 
were performed, as schematically summarized in 
Figure 1. The breakpoints of cell lines Ad-211/ 
SV40, Ad-Z95/SV40, and Ad-366/SV40 appeared to 
be located in the DNA region between cRM76 and 
cRMl33, which was estimated to be about 75 kb. 
The breakpoints of the four other cell lines were 
found in different areas of the 1.7 Mb region be- 
tween cRM36 and cRMl03. That of cell line Ad- 
2467SV40 in a DNA segment of about 270 kb be- 
tween cRM33 and cRM76, that of Ad-263/SV40 in 
a DMA segment of about 1 Mb between cRM98 and 
cRM 103, that of Ad-302/SV40 in a DNA segment of 
about 240 kb between cRM33 and cRM36, and that 
of Ad-386/SV40 in a DNA segment of about 100 kb 
between cRM98 and cRM133. In conclusion, these 
results indicated that the chromosome 12 break- 
points of most (5 out of 7) of the cell lines are 
dispersed over the 445 kb genomic region on the 
long arm of chromosome 12 between cRM33 and 



cRM98. It is important to note already here that 
precisely this region was recently shown to contain 
the chromosome 12q breakpoints in cell lines de- 
rived from primary uterine leiomyomas (see Fig. 3) 
and was therefore designated ULCRlZ (Schoen- 
makers et al., 1994b). As this segment of the long 
arm of chromosome 12 is involved in at least two 
types of solid tumors (Schoenmakers ec ah, 1994b; 
this study) and, as we will show below, also in a third 
solid tumor type, we will from now on refer to the 
DNA interval between cRM36 and cRM103 as 
MAR (multiple aberration region). 

FISH Mapping of I2q Breakpoints In Primary 
Pleomorphic Salivary Gland Adenomas 

Our FISH studies on metaphase chromosomes 
of pleomorphic adenomas of the salivary glands 
presented so far were restricted to cell lines derived 
from primary tumors. Although it is reasonable to 
assume that the chromosome 12 breakpoints in 
cell lines are similar if not identical to the ones in 
the corresponding primary tumors, differences as a 
result of the establishment of cell lines or subse- 
quent cell culturing cannot fully be excluded. 
Therefore, we have investigated whether the chro- 
mosome 12 breakpoints in three primary salivary 
gland adenomas were mapping to MAR as well. To 
test this possibility, a combination of cosmid clones 
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cRM33 and cRMIOj was used as a molecular 
probe. In all three cases, this cosmid pool clearly 
spanned che chromosome 12 breakpoints (data not 
shown), indicating that these breakpoints were in- 
deed localized within MAR. In a recent study 
(Waruchura ec al., submitted tor publication), it 
was reported chat the chromosome 12 breakpoints 
of five primary uterine leiomyomas with I Zq 14-1 5 
aberrations were all found to cluster within the 1.5 



Mb DN r A fragment (between cRM33* and 
cRMl03), which is known co harbor che break- 
point of various cell lines derived from primary 
uterine leiomyomas (schematically summarized in 
Fig. 5 j. Consistent with the results of che break- 
pom: mapping studies using cell lines, the results 
with the two primary solid tumor types establish 
that the breakpoints of the primary tumor cells are 
also located in MAR. 
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Chromosome Segment I2ql3-ql5 Breakpoints of 
Upomas Mapping Within MAR 

To test the possibility chat the chromosome 12 
breakpoints of other" solid rumors with 12ql3-ql5 
aberrations also mapped within MAR, we studied 
two lipomas cell lines by FISH analysis— Li-14/ 
SV40 and LU538YSV40. The chromosome 12 aber- 
rations of these two lipoma cell lines are given in 
Table 1. As molecular probe*, cosmid clones cRM33, 
cRM53. cRM72, cRM76, cRM99, cRM103, and 
cRM133 were used. The breakpoint of Li-14/$V40 
was mapped co the 75 kb DNA interval between 
KM76 and RM133. and chat of U-538/SV40 co the 
90 kb interval between RM76 and RM99 (data not 
shown), as schematically iltuscraced in Figure 3. 
Similar FISH analysis of two primary lipomas using 
a mixture of cRM36 and cRM103 as molecular 



probe resulted in a hybridization pattern indicating 
that the mixture of probes detected sequences on 
either side of the breakpoints. These results are the 
first indications that also in lipoma, chromosome 
12ql3-ql5 breakpoints occur chat map within 
MAR. More lipoma cases should be tested to allow 
proper interpretation of this observation. 

DISCUSSION 

In this study, we have mapped the chromosome 
12 breakpoints of three primary pleomorphic sali- 
vary gland adenomas as well as seven established 
cell lines derived from such tumors. All break- 
points appeared to be located in a previously mo- 
lecularly cloned and characterized chromosome 
DNA segment on the long arm of chromosome 12, 
about 1.7 Mb in size, with five of them clustering 
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in a DNA interval of less than 500 kb. The 1.7 Mb 
DNA region apparently contains a major break- 
point cluster region for this type of tumor. In a 
previous study, we have described the character- 
ization of the chromosome 12 breakpoint of pleo- 
morphic salivary gland adenoma cell line Ad -3 12/ 
SV40 (Kools et al., 1995). The breakpoint of this 
cell line is now known co map at a distance of more 
than 2 Mb distaily to this major breakpoint cluster 
region reported here. It is possible that the Ad-312/ 
SV40 breakpoint involves other pathogenerically 
relevant genetic sequences than those affected by 
the clustered breakpoints. However, the possibil- 
ity should not yet be excluded that all the 12ql3- 
ql5 breakpoints in pleomorphic salivary gland ad- 
enomas mapped so far belong to the same category 
and are dispersed over a relatively large DNA re- 
gion of this chromosome, reminiscent of the llql3 
breakpoints in B-cell malignancies (Raynaud et al., - 
1993). More precise pinpointing of the various 
breakpoints could shed more light on this matter. 

Of importance is the observation that the DNA 
segment that harbors the clustered 12q breakpoints 
of pleomorphic salivary gland adenomas appears to 
coincide with the DNA region that was recendy de- 
fined as the uterine leiomyoma cluster region of 
chromosome 12 breakpoints, known as ULCR12 
(Schoenmakersetal., 1994b). Of further interest is 
the fact that this region of chromosome 12 also har- 
bors breakpoints of primary lipomas and lipoma cell 
lines derived from primary tumors with 12ql3-ql5 
aberrations. Altogether, the results of all these stud- 
ies now clearly demonstrate that chromosome 12 
breakpoints of three distinct solid tumor types map 
to the same 1.7 Mb genomic region on the long arm 
of chromosome 12, establishing this region to be a 
multiple aberration region. To reflect this character- 
istic, we have designated this DNA segment MAR. 

Genetic aberrations involving chromosomal re- 
gion X2ql3-ql5 have been implicated by many cy- 
togenetic studies in a variety of solid tumors other 
than the three already mentioned. Involvement of 
12ql3-ql5 has also been reported for endometrial 
polyps (Walter ec ai., 1989; Vanni et al., 1993), 
clear cell sarcomas characterized by recurrent t(12; 
22)(ql3;ql3) (Fletcher, 1992; Reeves etal.. 1992; 
Rodriguez et al., 1992), a subgroup of rhabdomyo- 
sarcomas (Roberts et al., 1992) and hemangioperi- 
cytoma (Mandahl etal., 1993a), chondromatous tu- 
mors (Mandahl et ai., 1989; Bridge et al., 1992; 
Hirabayashi et al., 1992; Mandahl et al., 1993b), 
and hamartoma of the lung (Dal Cin et al., 1993). 
Finally, several case reports of solid tumors with 
involvement of chromosome region I2ql3-ql5 
have been published — e.g., tumors of the breast 



(Birdsai et al., 1992; Rohen et al., 1993), diffuse 
astrocytomas (Jenkins et al., 1989), and a giant-cell 
tumor of the bone (Noguera et al., 1989). On the 
basis of results of cytogenetic studies, no predic- 
tions could be made about the relative distribution 
of the breakpoints of these tumor types. In light 
of the results of the present study, it would be of 
interest to see whether the breakpoints of any of 
these solid tumors also map within or dose to 
MAR. The various cosmid clones available now 
provide the means to test this readily. 

The observation that 12q breakpoints of at least 
three different types of solid tumors map to the 
same DNA region is intriguing as it could be point- 
ing towards the possibility that the same generic 
sequences in MAR are pathogenerically relevant for 
tumor development in different tissues. If so, it is 
tempting to speculate that the gene(s) affected by 
the genetic aberrations might be involved in growth 
regulation. On the other hand, one cannot yet ex- 
clude the possibility that genetic sequences in MAR 
are not pathogenerically relevant, as the observed 
clustering of genetic aberrations in MAR could sim- 
ply reflect genetic instability of this region, which 
becomes apparent in various solid tumors. To ob- 
tain more insight into this matter, the genes residing 
in MAR should be identified and characterized, and 
this can be achieved by various approaches using 
several techniques (Parrish and Nelson, 1993). 
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Identification of the Chromosome 12 Translocation 
Breakpoint Region of a Pleomorphic Salivary Gland 
Adenoma with t(l;12)(p22;ql5) as the Sole Cytogenetic 
Abnormality 

Patrick F. J. Kools, Sylke Wanschura, Eric F. P. M. Schoenmakers, 
Jan M. W. Geurts, Raf Mols, Bernd Kazmierczak, J5rn Bullerdiek, 
Herman Van Den Berghe, and Wun J. M. Van de Ven 



ABSTRACT: Cell Jine Ad-312fSV40, which was derived from a primary pleomorphic salivary gland ade- 
noma with t(l;U)(p22;qisi was used in fluorescence in situ hybridization (FISH) analysis to characterize 
its translocation breakpoint region on chromosome 12. Results of previous studies have indicated that 
the chromosome 12 breakpoint in M-TL2/SV40 is located proximally to locus D12S6 and dlstally to the 
CHOP gene. Wis here describe two partially overlapping yeast artificial chromosome C&Q clones, Y4654 
(500 kbp) and Y9091 [460 kbp), which we Isolated in .the context of a chromosome walking project with 
D12S8 and CHOP as starting points. Vk present a composite long-range restriction map encompassi n g 
the inserts of these two YAC clones and show by* FISH analysis that both YAC* span the chromosome 12 
breakpoint as present In M-TL2JSV40 cells. Subsequently, we have Isolated cosmid clones corresponding 
to various sequence-togged sites (STSs) mapping within the inserts of these YAC clones. These included 
cRMSl, cRM&% cBMSS, cBM90, cRM9t cRMXIO, and cRMlll. In FISH studies, cosmid clones cHMSS, cRM9Q, 
and cHMlll appeared to map distall J* to the chromosome 12 breakpoint, whereas cosmid clones cRMSl, 
cBM89. cRM9l. and cRMllO were /bund to map proximally to it These results assign the chromosome 
12 breakpoint in M-3WSV40 to a DSA region of Its than 165 kbp. FISH evaluation o/the chromosome 
12 breakpoints in fin other pleomorphic salivary gland adenoma cell lines indicated that these are lo- 
cated proximally to the one in Ad*3l2/SV40, at a distance of more than 0J9 Mbp from STS AMOL These 
results, while pinpointing a potentially critical region on chromosome 12, also provide evidence for the 
possible involvement of 12qi3-ql5 sequences located elsewhere. 



INTRODUCTION 

.eomorphic salivary gland adenoma constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary gland 
tumor and accounts for almost 50% of aU neoplasms in these 
organs; 85% of the tumors are found in the parotid gland. 
10% in the minor salivary glands, and 5% in the subman- 
dibular gland [1]. About 50% of these adenomas appear to 
have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome tnrrmtHes [2. 3]. 
Frequently observed inomilies include aberrations of chro- 
mosome 8, usually involving the {k(12-<jl3 region, with the 
most common aberration being a t(3;8)(p2fcqi2}» and aber- 
rations of chromosome 12, usually translocations involving 
region 12q13-ql5. Non-recurrent clonal chmmnsmne abnor- 
malities have also been reported. The highly specific pattern 
of chromosome r^ T * p * T *g* m * wt * with consi stent breakpoints 
at 8ql2-ql3 and 12ql3-ql5 suggests that these chromosomal 
regions harbor genes that might be implicated in the devel- 
opment of these tumors. Molecular cloning of the chromo- 
some breaVpointt and rr hir** fl yri?» rtflin «f A«ir jimertan*»g» 
ments may lead to the identification of pathoganstlcaBy 
relevant genes. At present, no such molecular data have yet 

been reported for these tumors. 

On the basis of fluorescence in situ hybridization (FISH) 
data, the chromosome 12 breakpoints in six pleomorphic sal- 
ivary gland adenoma cell lines were recently shown to be 
mapping to region 12ql3-ql5, more precisely, to the genomic 
interval between loci D12S19 and D12Sa [4, 5]. The sex- 
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Figurel Composite physical map of the ovarhpping DNA bsetts of clone* Y4B34 and Y909L Staas of the 
DNA inserts are Indicated. The relative positions of the YAC clones ere represented by bars below tl» long-ranfe 
physical map. Sequsoce-tiaed sites (SISs) earrtspondini to end-done* otWCM. iadudmg YAQ sot shown harm, 
are Indicated by bond RM codes above tha restriction map. STSs obtained &om inter-Alu-PCR products set given 
below the restriction map and the DNA regions to which they haw been mspi>td art marked by snows. B: BsmHH; 
Mi hOtih Pi Pvul; Sfe S/H. A polymorphic Mini stt* is marked by an • 
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averaged genetic size of this genomic DMA. interval was 
reported at HGM10 to be 7 cM [8j. We also reported that the 
chromosome 12 breakpoints In salivary gland adenomas map 
distally to tha CHOP gene [5J, which supports an earlier study 
indicating that the 12ql3-ql5 translocation breakpoints in 
pleomorphic salivary gland «dimnm» T ^ different from that 
In myxoid Uposarcoma [7]. Here, we report the physical map- 
ping of the chromosome 12 breakpoint in pleomorphic sail* 
vary gland adenoma cell line Ad-312/SUft& which carries a 
tfrl2)(p22;qiS) as the only cytogenetic abnormality. 

MATERIALS AND METHODS 
Tumor Cell Uses 

H um an tumor cell lines used in this study included the pre- 
viously described pleomorphic salivary gland adenoma cell 



lines Ad-248/5V40, Ad-2S3/SV4Q, Ad-295/SVtO, Ad-302/SVftO, 
Ad-312/SV40. and Ad-3M/SV40 [5, B]. Calls ware cultivated 
in 1Q99 culture medium with Eaxle*s salts sur^lemanted 
with 20% fetal bovine serum. Other cell lines used la this 
study included somatic cell hybrid PKB9-12. which contains 
chromosome 12 as the sole human chromosome in a ham- 
ster genetic background [9J, and somatic cell hybrid US-3/ 
SV4Q/A9-B4 [4]. The latter cell line was obtained upon fu- 
sion of the myxoid Uposarcoma call line US-4/SV4Q, which 
carries the specific t(12;lBHql3;pll.2). with mouse A9 
This somatic cell hybrid was previously shown to contain 
dar(16) but ti ft i th wr der(12) nor the normal chrornflffrnne 12 
[4]. PKB9-12 and US-3/SVWA9-B4 cells were grown in DMB- 
F12 mmifiim supplemented with 10% fetal bovine serum. Cell 
lines were analyzed by standard cytogenetic at 
regular intervals. 
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Chromosome 12 Breakpoint of a Salivary Claud Adenoma 

s Isolation of YAC and Cosmid Clones 

In the context of human genome mapping studies, which 
will be described in detail elsewhere (Schoenmaken et aL, 
in preparation), we isolated YAC clones Y4854 and Y9091 
from the first-generation CEPHYAC library [10], and cosmid 
clones cRMSl. cRM69, cRM85. CRM90, cRM91. CRM103, 
cRMHO, and cRMlll from the chromosome-12-spedfic ar- 

10 rayed cosmid library LLNLNCOl (ill. YAC and cosmid clones 

ivere isolated as described before [5], Initial screenings of 
the YAC, as well as the cosmid library, were performed using 
a screening strategy involving the polymerase chain reac- 
tion (PCR) [12]. Filter hybridization analysis was used as the 
final screening step* as previously described [5]. Cosmid 

75 clones were isolated using STSs and those corresponding 

to STSs within the inserts of YAC clones Y4834 tad Y9091 
are indicated In Figure l. STSs were obtained via rescue of 
YAC insert end-sequences using a vectorette-PCR procedure 
(13)- or Alu-PCR [14. 1SJ. PCR products ware sequenced 
directly via- solid-phase fluorescent sequencing. Cosmid 

20 clones were grown and handled according to standard procu- 

res [IB]. YAC clones were characterised by pulsed-field 
gel electrophoresis [17], restriction mapp log, and hybridiza- 
tion, as previously described [5]. 

Chromosome Preparations and Fluorescence 
in Situ Hybridization 

25 Cells from the pleomorphic salivary gland adenoma tumor 

cell lines were treated with Colcemid (004 ugfal) lor 30 min 
and then harvested according to routine methods. Metaphase 
spreads of the tumor cells were prepared as described be- 
fore [4], lb establish the identity of chromosomes in the FISH 
experiments* FISH analysis was performed alter G-banding 

30 of the same metaphase spreads. G-banding was performed 

essentially as described by Smit et al. [18]. In situ hybridiza- 
tions were carried out according to a protocol described by 
Kievits et al. [19] with some minor modifications [5, 20}. 
Cosmid and YAC DNA was labeled with biotin-ll-dUTP 
(Boehringer Mannheim) or biotln-14-dATP (BRL, Caithars- 

2£ burg), as described earlier [3] . Chromosomes were counter- 

stained with pro pid turn iodide and analyzed on a Zeiss Ax- 
iphot fluorescence microscope using a FITC filter (Zeiss), 
.results were recorded on Scotch (3M) 64QASA film, 



RESULTS 

Isolation and Characterization of YAC Clones Spanning 
the Chromosome 12 Breakpoint of Pleomorphic 
Salivary Gland Adenoma Cell Line Ad-312/SV40 
In previous studies [5], we mapped the chromosome 12 break- 
points of six pleomorphic salivary gland adenoma cell Unas 
proximally to locus D12S8 and distally to CHOP. The DNA 
interval between these loci is somewhat smaller than 7 cM 
(estimated distance between the loci 012S8 and D12S19 [8]) 
but still substantially large. To molecularly define the trans- 
location breakpoint of Ad-3U/SV40. we have performed hu- 
man genome mapping studies on the DNA interval between 
locus D12S8 and the CHOP gene. In the process of directional 
chromosome walking starting from D12S8 and the CHOP 
gene, we obtained overlapping YAC clones Y9091 and Y48S4. 



The DNA insert of Y9091 appeared to be 460 kbp and that 
of Y4854. 500 kbp. Moreovet is we will demonstrate below, 
the DNA insert of each YAC clone appeared to span the chro- 
mosome 12 breakpoint of Ad-312/5V40. A long-range restric- 
tion map of the inserts of these YAC clones was made using 
pulsed-field gel electrophoresis and hybridization analysis 
(Fig. l). On the basis of STS content mapping and Southern 
blot analysis, the inserts of YAC clones Y9091 and Y48S4 ap- 
peared to overlap, as indicated in Figure L The tested STSs 
correspond to end-sequences of other overlapping YAC clones 
not shown here or to sequences obtained via inter-Alu-PGL 
Of these, RM90 and RM91 represent such end-clone STSs 
of YAC Y9091, and RM48 and KM54 of Y4854. whereas RMllO 
and RMlll represent STSs derived from mter-Alu-PCR. Rjr 
a number of STSs mapping within the inserts of YAC clones 
Y4854 and Y9091, corresponding cosmid clones were iso- 
lated for use in FISH analysis, ag.. cRMSt cRM69. cRMSS, 
cRM90, CRM91, cRMllO, and cRMlll. 

The inserts of the two overlapping YAC clones are most 
likely not chimeric as was deduced from the following ob- 
servations. FISH analysis of metaphase chromosomes of nor- 
mal human lymphocytes with Y4854 or Y9091 DNA u 
molecular probe revealed hybridization signals only in chro- 
mosome region 12ql3-ql5. For Y9G91. this was confirmed fur- 
ther by observations made In FISH studies In which cosmid 
clone cRMSO or cRMdl was used as probe; the DNA insert 
of each of these two cosmids corresponds to the alternative 
end-sequences of YAC done 1ft09L Finally, the end-sequence 
STSs of Y9091 appeared to map to chromosome 12 and dis- 
tally to the CHOP gene, as was established by PCR analysis 
on PK89-12 DMA, which contains human chromosome 12 
as the sole human chromosome in a hamster genetic back- 
ground, and US-3/SV40/A9-B4 DNA, which was previously 
shown to contain der(18). from the specific t(12;13) of myx- 
oid Uposarcoma, but neither der(12) nor the normal chro- 
mosome 12 [4]. From the chromosome walking studies, we 
concluded that the overlapping inserts of the two YAC clones 
represent a DNA region of about 340 kbp, which is located 
on chromosome 12q between D12S3 and CHOP. As the 840- 
kbp composite long-range restriction map of the YAC condg 
was constructed with at least double coverage of the entire 
region, it is not unreasonable to assume that the 840-kbp 
region is contiguous with the chromosomal DNA. although 
microdeletions cannot be excluded at this point 

Chromosome walking was routinely evaluated by FISH 
mapping of YAC clones or cosmid clones corresponding to 
YAC insert sequences. It should be noted that for the iden- 
tification of diromosomas. G-banding was used m most cases. 
On the basis of such G-banding, hybridization signals could 
be assigned conclusively to chromosomes of known iden- 
tity; this was also of importance for the cases with cross- or 
background hybridization signals that were occasionally ob- 
served. G-banding prior to FISH analysis sometimes resulted 
in rather weak hybridization signals or rather vague band- 
ing patterns. Therefore, we performed FISH experiments in 
which the YAC and cosmid clones to be evaluated were 
used in combination with a reference probe. Cosmid done 
cPKl2qter, which was serendipitously obtained during 
screening of a cosmid library, was selected as reference 
marker. FISH analysis of metaphase chromosomes of nor- 
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Figure 2 A) Mapping of cos mid clone cPKl2qt»r to the telomerfc 
region of the long arm of chromosome 12. Centromere 12-spedfic 
probe pal2H8 was used to establish the identity of chromosome XZ. 
FISH analysis was performed on meta phase chromosomes of con- 
trol human lymphocytes. Hybridization signals of ePKUqter an 
marked with small arrowheads, those of the centromere 12-specific 
probe with asterisks. B. C) FISH analysis of metaphase chromosomes 
of Ad-3;2.'SV40 cells using DNA of YAC clone Y4854 (B) or 
(C) as molecular probe in combination with cosm id clone cFKQqter 
as reference marker. Hybridization signals of the YAC clones on duo- 



Figure 3 FISH analysis of metaphase chromosomes of Ad-JQ/SW 
cells using DNA of cosmid clone cRM89 (A), or cRMUl (B) «smo- 
lacular probe in combination with cosmid clone cPKl2qttrs*rsfe» 
ence marker. The position of the hybridization signals of cPlCOqtar 
are indicated by small arrowheads. In (A), the position of the hy- 
brid trat ion signal of cRM69 on normal chromosome 12 is Inillfafeil 
by a large arrowhead, and that on der(l2) with a small arrow. In fB). 
the position of the hybridization signal of cRMUl on normal dm> 
mosome 12 Is Indicated by a large arrowhead, and that on da*tl) 
with a targe arrow. 



mal lymphocytes (Fig. 2A) revealed that cPKtfqter maps to 
the telomeric region of the long arm of chromosome) 12. lb 
identify chromosome 12 in this experiment, centromere 12- 
specific probe pal2H8 [21] u-as used. FISH analysis of 
metaphase chromosomes of Ad-3i2/SV«w cells using YAC 
clone Y4854{Fig. 2B) or Y9091 {Fig. 2C) in combination with 
reference probe cPKl2qter revealed, in both cases, hybrid- 
ization signals of the YAC insert on der(l) as well as der(12). 
We concluded from these results that the insert DNA of each 



mosorp.e 12 are indicated by targe arrowheads: those on derfl) by 
large arrows, and those on der1i2) b>- small arrows, respectively The 
hybridization signals of cosmid clone cPKUqter are indicated by 
small arrowheads. 
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Figure 4 Sehtooatlc representation of FISH mapping data obtained for to six plaoc^hfc salitiry f>iri ade- 
noma c»ll Uha* tm**A tn fhu «tn/4 y Th« «j^Hft<- ^mi^m^ 17 ibtmtf?os tn tha Tiiimu call lima ira gfrra. Cos 
mid donas which war* usad as probes la the FISH mapping stadias correspond to saqoanca-taggsd sitaa obtained 
from overlapping YAC clouts. Individual FISH exparunants an Indicatad by dots. Coemtt donas were named after 
the acronynu of the 5I5a>u shown utbebo^ 

between RM90 and RMHO ti tstlmatad to b« ahotii i_a Mty. Tn«m+- Sz-htri— <«> > . r > ... n»«rt^ rrfth« r pi—J^ d*r f»- 
tiva chrom o so m es dsrfl aad derttt) of tha Ad-312/SVlO call Una, which carries a t(Ul2Mp2fcalS). Tea positions of 
the chromosoma U braakpolati of Ad-2ia/SV*a Ad-283J5Vta, Ad-293/SVtQ, Ad-302/SVtO, ind Ad«3S6/SVio an distal 
to RM103 as indicated by the arrow. 



YAC clone might span tha chromosome 12 breakpoint in *!Hf 
call Una. It should bo noted that G-banding revealed a telo- 
merlc association involving tha short arm of chromosoma 
12 In Figure 2C. The observition that YAC done Y9091 
spanned the chromosome 12 breakpoint inAd-312/SV40waa 
>nflrmed independently in FISH studies In which eosmld 
clone cRM90 or cRM91 was used as molecular probe; they 
ware shown to contain tha alternative end-sequences of the 
Y9091 insert. cRM90 appeared to map dlstaHy to the chro- 
mosome 12 breakpoint whereas cRM9l was found to map 
proximally (data not shown). These results also established 
the chromosomal orientation of the YAC conUg shown In Fig- 
ure 1. In summary, we concluded from these FISH studies 
that the chromosome 12 translocation breakpoint in Ad- 
312/SV40 must be located in the DNA interval correspond- 
ing to the overlapping sequences (about 300 kbp) of the two 
YAC clones. 



Fine Mapping of the Chromosome 12 Translocation 
Breakpoint of Ad-312/SV40 

In an approach to further narrow the chromosome 12 trans- 
location breakpoint region of Ad-312/SV4a cosmid clones 
with different mapping positions within YAC clone Y9091 
were isolated. These included cRM69. cRMoS. cRMlia and 



cRMlll. cRM69 and cRMSS were isolated on the basis of SIS 
sequences of YAC donas not shown here. cRMllO and 
cRMlll were obtained via inter-Alu-FCR. RM110 was shown 

by Srarthrn h\d analyst tn hyhrldtea tn m trmtwl MlnT frf- 

must of YS091 end not to the DMA insert of the overlapping 
YAC clone with RM69 astelomeric end-sequences. The lo- 
cation afRMllO is as indicated m Figure LRMm was shown 
to hybridize to a BssHII. Mlul. Pvul, and S/H fragment of 
Y9091 and is therefore located tn the Pvul-Sfil fragment of 
Y9091, to which STS RMiS was also mapped (Fig. 1). FISH 
analysis of metaphtse chromosomes of Ad-J12/SViO with 
cRMSQ or cRMllO as probe indicated that the DHA insert 
of these <**wiwMf mapped pradmally to the r h wfiTiftffline 12 
txarislocation breakpoint in this call line, as illustrated for 
cRMB9 tn Figure 3A. Subsequent FISH analysis of Ad- 
312/SV40 with cRMflS or cRMlll as probe revealed hybrid- 
ization signals distally to the translocation breakpoint, as 
illustrated for cRMlll in Figure 3B. Tha results with cRMSS 
and cRMlll are in agreement with the observed breakpoint 
spanning by YAC done Y4554. as cRMSS maps distally and 
cRMlll closely to STS RM4S, which marks thetalomerieend 
of the YAC clone Y4&54. In cosdusion. tha chromosome 12 
translocation breakpoint in M-312/5V40 must be located m 
the DNA interval between cRMllO and cRMlll. as summa- 
rized schematically in Figure 4. 
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FISH Evaluation of Chromosome 12 Breakpoints in 
Other Pleomorphic Salivary Gland Adenoma 
Cell Lines 

Tb determine the position of their chromosome 12 break- 
points relative to that of Ad-3U/SV*4Q, five other pleomorphic 
salivary gland adenoma cell lines were evaluated by FISH 
analysis, as summarized schematically In Figure 4. These 
cell lines, which were developed from primary tumors (5, 
a], included Ad-248/SV40. Ad-263/SV40, Ad-295/5V40, Ad- 
302/SV40, and Ad-386/SV40. The chromosome 12 aberrations 
of these ceil lines are listed in Figure 4. FISH analysis of 
metaphase chromosomes of these cell lines using cRM91 re- 
vealed that the chromosome 12 breakpoints of all these cell 
lines mapped prcodmally to this cosmid clone (data not 
shown). Similar FISH analysis wis also performed using a 
cosmid clone corresponding to sequence-tagged site RM303 
as a probe. RM103 was found to map pnscimally to RM91 
at a distance of about 04 Mbp. In ail cues. cRMlOS appeared 
to map distally to the chromosome 12 translocation break- 
points, indicating that the chromosome 12 breakpoints in 
fK *se five pleomorphic salivary gland adenoma cell lines are " 

ated at a relatively large distance from that of Ad-312/5V40 
cells. 



DISCUSSION 

In the studies presented here, we have identified, molecu- 
laxly cloned, and characterized & chromosome region on the 
long arm of chromosome 12 in which the translocation break- 
point of pleomorphic salivary gland adenoma cell line Ad/ 
312/SV40 appears to map. In previous studies [5]. we already 
provided evidence that the chromosome 12 breakpoint of this 
cell line was located between D12S8 and CHOP. Becausa the 
two breakpoints spanning YAC clones described here were 
obtained in directional chromosome walking experiments 
using DttSa and the CHOP gene as initial starting points, 
the chromosome 12 breakpoint mapping results presented 
here confirm our previous claim. The FISH results obtained 
with the complete YAC insert of Y9091 as molecular probe 
were confirmed independently in FISH studies using cos- 
mid clones containing sequences corresponding to various 

Ions of the insert of this YAC done. This is of importance, 
as the independent confirmatory results make it rather un- 
likely that the split signals observed with the complete in- 
sert of Y3091 can be explained otherwise than by a factual 
splitting of sequences represented in the YAC The presence, 
for instance, of highly related genetic sequences on both sides 
of a chromosome breakpoint could easily lead to erroneous 
conclusions if they were based solely on. FISH results of a 
YAC insert. Finally, our mapping studies have also estab- 
lished conclusively the chromosomal orientation of the long- 
range restriction map we have generated in these studies. This 
orientation was already predicted on the basis of two-color 
FISH studies (unpublished observations 1. 

The FISH studies, described here, enabled us to map the 
chromosome 12 breakpoint in Ad7312/SV40 cells to the 190- 
kbp DNA interval between the established STSs RM48 and 
RM69. However the breakpoint region can be narrowed some- 
what further on the basis of the following. The feet that Y4854 
was shown to span the breakpoint indicates that at least a 



P. F. I. Kools at aL 

considerable part of the telomeric half of the YAC clone most •> 
map distally to the breakpoint Precisely how much remains 
to be established. On the other side. STS RM69 appeared 
to be located In about the middle of the DNA Insert of cos- 
mid clone cRM89. suggesting that the breakpoint is close 
to 25 kbp distally to RMB9. Moreover. cRM69 appeared to 
lackRMllQ (data not shown) and. as cRMllO was found pca»» 
irnally to the chromosome 12 breakpoint in Ad/3l2/SV40 cells, 
the breakpoint should be even further distal to RM69 than 
the earlier-mentioned 25 kbp. Altogether, this narrows the 
chromosome 12 breakpoint region to a DNA interval, which 
must be considerably smaller than 155 kbp. Further pinpoint- 
ing of the breakpoint wilt allow us to molecularly clone the 
chromosome 12 breakpoint and to characterize the genetic 
sequences in the breakpoint junction region, which might 
lead to the Identification of pathogenedcally relevant se- 
quences. Identification of the genes present in the DNA In- 
serts of YAC clones Y4B54 and Y9091. via sequencing, direct 
hybridization, direct selection, or ron-trepping, might con- 
stitute a useful alternative approach for identifying the gene 
in this region of the long arm of chromosome 12 that might 
be pathogenetically critical for pleomorphic salivary gland 
adenoma tumorigenesxs. 

The observation that the chromosome 12 breakpoints In 
other pleomorphic salivary gland adenomas axe located in 
a remote and more proximal region on the long arm of chro- 
mosome 12 is of interest It could imply that the chromo- 
some 12 breakpoints in pleomorphic salivary gland ade- 
nomas are dispersed over a relatively large DNA region of 
the long arm of chromosome 12, reminiscent of the llqS3 
breakpoints in B-call malignancies [22]. Elucidation of the 
precise location of the chromosome 12 breakpoints in the 
other pleomorphic salivary gland adenoma cell lines could 
shed more light on this mattec On the other hand, it could 
point towards alternative sequences on the long arm of chro- 
mosome 12 between D12S8 and the CHOP gene that might 
be of importance, presumably for grewm regulation in pleo- 
morphic salivary gland adenoma. The fact that the chromo- 
some 12 breakpoint region described here has so for been 
found only in the Ad-312/SViOceil line makes it necessary 
to analyze a larger number of salivary gland adenomas with 
chromosome 12qtt~ql5 aberrations to assess the potential rel- 
evance for tumorigenesis of the chromosome 12 se quences 
affected in the studied cell line, if more cases with aberra- 
tions in this particular region of chromosome 12 can be 
found, itwouldbeof iiiteresttonndotttwhethwthw 
form a clinical subgroup. Finally, chromosome tnniloratfrms 
involving region qG~ql5 of human chromosome 12 have been 
reported for a variety of other solid tumors: benign adipose 
tissue tumors, uterine leiomyoma, rhabdomyosarcoma, hem- 
angiopericytoma, clear-cell sarcoma, chondromatous tumors, 
and hamartoma of the lung. Whether or not the chromosome 
12 breakpoints in some of these tumors map within the same 
region as that of Ad-312/SV40 remains to be established. The 
YAC and cosmid clones described in this report constitute 
useful tools for investigating this. 

The availability of a copy of the nist-ttntimtfoa CEPH YAC Ubnry 
(ID) and a copy of ma aneyed ehcoaofoma-ia^ptcmc cosmid Ufansy 
(LLNU2NC0TJ [111 U greatly sckncwladgtd. The cos m id Ubi ety was 
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Chromosome 12 Breakpoint of a Salivary Gland Adenoma 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: K.U. Leuven Research & Development 

(B) STREET: Benedenstraat 59 

(C) CITY: Leuven 

(E) COUNTRY: Belgium 

(F) POSTAL CODE (ZIP) : B-3000 

(G) TELEPHONE: — 

(H) TELEFAX: — 

(I) TELEX: — 

(ii) TITLE OF INVENTION: Multiple-tumor aberrant growth genes 
(ill) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disfc 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1,25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 95201951.1 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: lipoma 

(H) CELL LINE: Li-501/SV40 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pCHlll 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 12+3 

(B) MAP POSITION: 12ql5+3q27-q28 
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(Ix) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 3.. 1031 

5 (D) OTHER INFORMATION: /partial 

/c don_start= 3 

/product 5 * "fusion gene HMG and LPP" 
/label- HMG-LPP 

(ix) FEATURE: 
10 (A) NAME/ KEY: CDS 

(B) LOCATION: 3.. 182 
(D) OTHER INFORMATION: /partial 
/codon_start— 3 

/product- "part of High Mobility Group protein" 
/label- HMG 

75 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 183.. 1031 

(D) OTHER INFORMATION: /partial 
/codon_start* 183 

20 /product= "part of Lipoma -Preferred Partner gene" 

/label- LPP 

(ix) FEATURE: 

(A) NAME/ KEY: primerjbind 
^ (B) LOCATION: 1..18 

(ix) FEATURE: 

(A) NAME/KEY: primer bind 

(B) LOCATION: 1000.. 1033 

30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CG CTT CAG AAG AGA GGA CGC GGC CGC CCC AGG AAG CAG CAG CAA AAA 47 
Leu Gin Lys Arg Gly Arg Gly Arg Pro Arg Lys Gin Gin Gin Lys 
15 10 15 

35 CCA ACC GGT GAG CCC TCT CCT AAG AGA CCC AGG GGA AGA CCC AAA GGC 95 

Pro Thr Gly Glu Pro Ser Pro Lys Arg Pro Arg Gly Arg Pro Lys Gly 
20 25 30 

AGC AAA AAC AAG AGT CCC TCT AAA GCA GCT CAA GAG GAA GCA GAA GCC 143 
Ser Lys Asn Lys Ser Pro Ser Lys Ala Ala Gin Glu Glu Ala Glu Ala 
AO 35 40 45 

ACT GAA GAA AAA CGG CCA AGG GGC AGA CCT AGG AAA TGG GGT GGC CAT 191 
Thr Glu Glu Lys Arg Pro Arg Gly Arg Pro Arg Lys Trp Gly Gly His 
50 55 60 

45 TCA GGG CAA CTG GGG CCT TCG TCA GTT GCC CCT TCA TTC CGC CCA GAG 239 

Ser Gly Gin Leu Gly Pro Ser Ser Val Ala Pro Ser Phe Arg Pro Glu 
65 70 75 

GAT GAG CTT GAG CAC CTG ACC AAA AAG ATG CTG TAT GAC ATG GAA AAT 287 
Asp Glu Leu Glu His Leu Thr Lys Lys Met Leu Tyr Asp Met Glu Asn 
50 80 85 90 95 
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10 



15 



20 



25 



CCA CCT GCT GAC GAA TAC TTT GGC CGC TGT GCT CGC TGT GGA GAA AAC 335 
Pro Pro Ala Asp Glu Tyr Phe Gly Arg Cys Ala Arg Cys Gly Glu Asn 
100 105 no 

GTA GTT GGG GAA GGT ACA GGA TGC ACT GCC ATG GAT CAG GTC TTC CAC 383 
Val Val Gly Glu Gly Thr Gly Cys Thr Ala Met Asp Gin Val Phe His 
115 120 125 

GTG GAT TGT TTT ACC TGC ATC ATC TGC AAC AAC AAG CTC CGA GGG CAG 431 
Val Asp Cys Phe Thr Cys lie He Cys Asn Asn Lys Leu Arg Gly Gin 
130 135 140 

CCA TTC TAT GCT GTG GAA AAG AAA GGA TAC TGC GAG CCC TGC TAC ATT 479 
Pro Phe Tyr Ala Val Glu Lys Lys Ala Tyr Cys Glu Pro Cys Tyr He 
145 150 155 

AAT ACT CTG GAG CAG TGC AAT GTG TGT TCC AAG CCC ATC ATG GAG CGG 527 
Asn Thr Leu Glu Gin Cys Asn Val Cys Ser Lys Pro He Met Glu Arg 
160 165 170 175 

ATT CTC CGA GCC ACC GGG AAG GCC TAT CAT CCT CAC TGT TTC ACC TGC 575 
He Leu Arg Ala Thr Gly Lys Ala Tyr His Pro His Cys Phe Thr Cys 
180 185 190 

GTG ATG TGC CAC CGC AGC CTG GAT GGG ATC CCA TTC ACT GTG GAT GCT 623 
Val Met Cys His Arg Ser Leu Asp Gly He Pro Phe Thr Val Asp Ala 
195 200 205 

GGC GGG CTC ATT CAC TGC ATT GAG GAC TTC CAC AAG AAA TTT GCC CCG 671 
Gly Gly Leu He His Cys He Glu Asp Phe His Lys Lys Phe Ala Pro 
210 215 220 

30 CGG TGT TCT GTG TGC AAG GAG CCT ATT ATG CCA GCC CCG GGC CAG GAG 719 

Arg Cys Ser Val Cys Lys Glu Pro He Met Pro Ala Pro Gly Gin Glu 
225 230 235 

GAG ACT GTC CGT ATT GTG GCT TTG GAT CGA GAT TTC CAT GTT CAC TGC 767 
Glu Thr Val Arg He Val Ala Leu Asp Arg Asp Phe His Val His Cys 
35 240 245 250 255 

TAC CGA TGC GAG GAT TGC GGT GGT CTC CTG TCT GAA GGA GAT AAC CAA 815 
Tyr Arg Cys Glu Asp Cys Gly Gly Leu Leu Ser Glu Gly Asp Asn Gin 
260 265 270 

40 GGC TGC TAC CCC TTG GAT GGG CAC ATC CTC TGC AAG ACC TGC AAC TCT 863 

Gly cys Tyr Pro Leu Asp Gly His He Leu Cys Lys Thr Cys Asn Ser 
275 280 285 

GCC CGC ATC AGG GTG TTG ACC GCC AAG GCG AGC ACT GAC CTT TAG ATT 911 
Ala Arg He Arg Val Leu Thr Ala Lys Ala Ser Thr Asp Leu * He 
<5 290 295 300 

CAG TCA CCT GTT CAG CCG GCA CTG AGA AGA ACG AAC ACA AGA AAA AGA 959 

Gin Ser Pro Val Gin Pro Ala Leu Arg Arg Thr Asn Thr Arg Lys Arg 

305 310 315 

50 
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TAA GAA ATA CTA GAG TAA AGG CCA TCA AAC TAC GCG AAA AAA AAA AAA 1007 

* Glu lie Leu Glu * Arg Pro Ser Asn Tyr Ala Lys Lys Lys Lys 
320 325 330 335 

5 

AAA AAA AAA GAT GTC GAC GGA TCC TT 1033 
Lys Lys Lys Asp Val Asp Gly Ser 
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35 
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(2) INFORMATION FOR SEQ ID NO: 2: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 

r5 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: small intestine 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: small intestine cDNA library (Clontech, cat. 
20 no. HL1133b) 

(B) CLONE: various clones were used 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 3q27-28 

25 (ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 247.. 2085 



30 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTCACTTTTA TTTGGGGGTG TGGACAGCTG CTTTCCCAGG GGAGTACTTC TTACAGTGGG 60 

ATTTCAAGAC AAGATCGGCC TGAAGAAAAA TTATATTTGT ATATTTTTTA AAAAGTGGGA 120 

ACTTTGAGGC TCAGAGACAG AGCAGAAGAC AGAACCTGGT CTTCTGATTC CCTGTGTTCT 180 

GCTTTTTTCA TTGTTCCACT GGACGCTCAT CAGAGGGAAG ATCTTTTTCC TCAATTGATT 240 

CCAACA ATG TCT CAC CCA TCT TGG CTG CCA CCC AAA AGC ACT GGT GAG 288 
Met Ser His Pro Ser Trp Leu Pro Pro Lys Ser Thr Gly Glu 
15 io 

CCC CTC GGC CAT GTG CCT GCA CGG ATG GAG ACC ACC CAT TCC TTT GGG 336 
Pro Leu Gly His Val Pro Ala Arg Met Glu Thr Thr His Ser Phe Gly 
15 20 25 30 

AAC CCC AGC ATT TCA GTG TCT ACA CAA CAG CCA CCC AAA AAG TTT GCC 384 
Asn Pro Ser Jle Ser Val Ser Thr Gin Gin Pro Pro Lys Lys Phe Ala 
35 40 45 

CCG GTA GTT GCT CCA AAA CCT AAG TAC AAC CCA TAG AAA CAA CCT GGA 432 
Pro Val Val Ala Pro Lys Pro Lys Tyr Asn Pro Tyr Lys Gin Pro Gly 
50 55 60 
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GGT GAG GGT GAT TTT CTT CCA CCC CCA CCT CCA CCT CTA GAT GAT TCC 480 
Gly Glu Gly Asp Phe Leu Pro Pro Pro Pro Pro Pro Leu Asp Asp Ser 
65 70 75 

5 

AGT GCC CTT CCA TCT ATC TCT GGA AAC TTT CCT CCT CCA CCA CCT CTT 528 
Ser Ala Leu Pro Ser He Ser Gly Asn Phe Pro Pro Pro Pro Pro Leu 
80 85 90 

GAT GAA GAG GCT TTC AAA GTA CAG GGG AAT CCC GGA GGC AAG ACA CTT 576 
10 Asp Glu Glu Ala Phe Lys Val Gin Gly Asn Pro Gly Gly Lys Thr Leu 

95 100 105 110 

GAG GAG AGG CGC TCC AGC CTG GAC GCT GAG ATT GAC TCC TTG ACC AGC 624 
Glu Glu Arg Arg Ser Ser Leu Asp Ala Glu lie Asp Ser Leu Thr Ser 
115 120 125 

15 

ATC TTG GCT GAC CTT GAG TGC AGC TCC CCC TAT AAG CCT CGG CCT CCA 672 
He Leu Ala Asp Leu Glu Cys Ser Ser Pro Tyr Lys Pro Arg Pro Pro 
130 135 140 

CAG AGC TCC ACT GGT TCA ACA GCC TCT CCT CCA GTT TCG ACC CCA GTC 720 
20 Gin Ser Ser Thr Gly Ser Thr Ala Ser Pro Pro Val Ser Thr Pro Val 
145 150 155 

ACA GGA CAC AAG AGA ATG GTC ATC CCG AAC CAA CCC CCT CTA ACA GCA 768 

Thr Gly His Lys Arg Met Val He Pro Asn Gin Pro Pro Leu Thr Ala 

160 165 170 

25 

ACC AAG AAG TCT ACA TTG AAA CCA CAG CCT GCA CCC CAG GCT GGA CCC 816 

Thr Lys Lys Ser Thr Leu Lys Pro Gin Pro Ala Pro Gin Ala Gly Pro 
175 180 185 190 

ATC CCT GTG GCT CCA ATC GGA ACA CTC AAA CCC CAG CCT CAG CCA GTC 864 
30 He Pro Val Ala Pro He Gly Thr Leu Lys Pro Gin Pro Gin Pro Val 

195 200 205 

CCA GCC TCC TAC ACC ACG GCC TCC ACT TCT TCA AGG CCT ACC TTT AAT 912 

Pro Ala Ser Tyr Thr Thr Ala Ser Thr Ser Ser Arg Pro Thr Phe Asn 
210 215 220 

35 

GTG CAG GTG AAG TCA GCC CAG CCC AGC CCT CAT TAT ATG GCT GCC CCT 960 

Val Gin Val Lys Ser Ala Gin Pro Ser Pro His Tyr Met Ala Ala Pro 
225 230 235 

TCA TCA GGA CAA ATT TAT GGC TCA GGG CCC CAG GGC TAT AAC ACT CAG 1008 
40 Ser Ser Gly Gin He Tyr Gly Ser Gly Pro Gin Gly Tyr Asn Thr Gin 

240 245 250 

CCA GTT CCT GTC TCT GGG CAG TGT CCA CCT CCT TCA ACA CGG GGA GGC 1056 
Pro Val Pro Val Ser Gly Gin Cys Pro Pro Pro Ser Thr Arg Gly Gly 
255 260 265 270 

45 

ATG GAT TAT GCC TAC ATT CCA CCA CCA GGA CTT CAG CCG GAG CCT GGG 1104 
Met Asp Tyr Ala Tyr He Pro Pro Pro Gly Leu Gin Pro Glu Pro Gly 
275 280 285 

50 



55 
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TAT GGG TAT GCC CCC AAC CAG GGA CGC TAT TAT GAA GGC TAC TAT GCA 1152 

Tyr Gly Tyr Ala Pro Asn Gin Gly Arg Tyr Tyr Glu Gly Tyr Tyr Ala 
290 295 300 

5 

GCA GGG CCA GGC TAT GGG GGC AGA AAT GAC TCT GAC CCT ACC TAT GGT 1200 

Ala Gly Pro Gly Tyr Gly Gly Arg Asn Asp Ser Asp Pro Thr Tyr Gly 
305 310 315 

CAA CAA GGT CAC CCA AAT ACC TGG AAA CGG GAA CCA GGG TAC ACT CCT 1248 
10 Gin Gin Gly His Pro Asn Thr Trp Lys Arg Glu Pro Gly Tyr Thr Pro 

320 325 330 

CCT GGA GCA GGG AAC CAG AAC CCT CCT GGG ATG TAT CCA GTC ACT GGT 1296 
Pro Gly Ala Gly Asn Gin Asn Pro Pro Gly Met Tyr Pro Val Thr Gly 
335 340 345 350 

CCC AAG AAG ACC TAT ATC ACA GAT CCT GTT TCA GCC CCC TGT GCG CCA 1344 
Pro Lys Lys Thr Tyr lie Thr Asp Pro Val Ser Ala Pro Cys Ala Pro 
355 360 365 

CCA TTG CAG CCA AAG GGT GGC CAT TCA GGG CAA CTG GGG CCT TCG TCA 1392 
20 Pro Leu Gin Pro Lys Gly Gly His Ser Gly Gin Leu Gly Pro Ser Ser 

370 375 380 

GTT GCC CCT TCA TTC CGC CCA GAG GAT GAG CTT GAG CAC CTG ACC AAA 1440 
Val Ala Pro Ser Phe Arg Pro Glu Asp Glu Leu Glu His Leu Thr Lys 
385 390 395 

25 

AAG ATG CTG TAT GAC ATG GAA AAT CCA CCT GCT GAC GAA TAC TTT GGC 1488 
Lys Met Leu Tyr Asp Met Glu Asn Pro Pro Ala Asp Glu Tyr Phe Gly 
400 405 410 

CGC TGT GCT CGC TGT GGA GAA AAC GTA GTT GGG GAA GGT ACA GGA TGC 1536 
30 Arg Cys Ala Arg Cys Gly Glu Asn Val Val Gly Glu Gly Thr Gly Cys 

415 420 425 430 

ACT GCC ATG GAT CAG GTC TTC CAC GTG GAT TGT TTT ACC TGC ATC ATC 1584 
Thr Ala Met Asp Gin Val Phe His Val Asp Cys Phe Thr Cys lie lie 
435 440 445 

TGC AAC AAC AAG CTC CGA GGG CAG CCA TTC TAT GCT GTG GAA AAG AAA 1632 
Cys Asn Asn Lys Leu Arg Gly Gin Pro Phe Tyr Ala Val Glu Lys Lys 
450 455 460 

GCA TAC TGC GAG CCC TGC TAC ATT AAT ACT CTG GAG CAG TGC AAT GTG 1680 
40 Ala Tyr Cys Glu Pro Cys Tyr He Asn Thr Leu Glu Gin Cys Asn Val 

465 470 475 

TGT TCC AAG CCC ATC ATG GAG CGG ATT CTC CGA GCC ACC GGG AAG GCC 1728 
Cys Ser Lys Pro He Met Glu Arg He Leu Arg Ala Thr Gly Lys Ala 
480 485 490 

45 

TAT CAT CCT CAC TGT TTC ACC TGC GTG ATG TGC CAC CGC AGC CTG GAT 1776 
Tyr His Pro His Cys Phe Thr Cys Val Met Cys His Arg Ser Leu Asp 
495 500 505 510 

SO 
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GGG ATC CCA TTC ACT GTG GAT GCT GGC GGG CTC ATT CAC TGC ATT GAG 1824 

Gly lie Pro Phe Thr Val Asp Ala Gly Gly Leu lie His Cys lie Glu 
5 515 520 525 

GAC TTC CAC AAG AAA TTT GCC CCG CGG TGT TCT GTG TGC AAG GAG CCT 1872 
Asp Phe His Lys Lys Phe Ala Pro Arg Cys Ser Val Cys Lys Glu Pro 
530 535 540 

10 ATT ATG CCA GCC CCG GGC CAG GAG GAG ACT GTC CGT ATT GTG GCT TTG 1920 

lie Met Pro Ala Pro Gly Gin Glu Glu Thr Val Arg He Val Ala Leu 
545 550 555 

GAT CGA GAT TTC CAT GTT CAC TGC TAC CGA TGC GAG GAT TGC GGT GGT 1968 
Asp Arg Asp Phe His Val His Cys Tyr Arg Cys Glu Asp Cys Gly Gly 
15 560 565 570 

CTC CTG TCT GAA GGA GAT AAC CAA GGC TGC TAC CCC TTG GAT GGG CAC 2016 
Leu Leu Ser Glu Gly Asp Asn Gin Gly Cys Tyr Pro Leu Asp Gly His 
575 580 585 590 

20 ATC CTC TGC AAG ACC TGC AAC TCT GCC CGC ATC AGG GTG TTG ACC GCC 2064 

He Leu Cys Lys Thr Cys Asn Ser Ala Arg He Arg Val Leu Thr Ala 
595 600 605 



25 



30 



40 



50 



55 



AAG GCG AGC ACT GAC CTT TAGATTCAGT CACCTGTTCA GCCGGCACTG 2112 
Lys Ala Ser Thr Asp Leu 
610 



AGAAGAACGA 


ACACAAGAAA 


AAGATAAGAA 


ATACTAGAGT 


AAAGGCCATC 


AAACTACGCG 


2172 


ATAGTCTCTG 


TTCTTCATCT 


GCTATTAACC 


TTGCCTTAGA 


AACACATAAA 


TTATGAGATT 


2232 


TTTTTTTAAA 


AGTTGTTACC 


AAATACACAT 


TTCACATTGA 


ATCATGTAGG 


ATCTTGATGG 


2292 


GCCTTTGTTC 


CCAAGGACTT 


CCACATTTTT 


GCACAGATTA 


TGCTCCATCC 


CTTCACTTCT 


2352 


GCATTCCTGT 


AACTTTTAAT 


CCCTATGTTT 


GTCTCACTTT 


TCATCTGGTT 


GAATGGCTTT 


2412 


TCTTAGTGTG 


GTATTTGCTG 


TCACATAGTT 


TTTTCCTGGG 


TGAGTCTGCC 


AACTCACAGG 


2472 


TGCTTTTAGG 


CTTGAAATCT 


CCATCCTATC 


ATTTCCGTTT 


TGCCTGTGAC 


TGTAAAGAGT 


2532 


AGCCATTCTT 


TTCCCATGTA 


TTGAAGAGGA 


TATTCTTCTC 


TTGCTTTATA 


CTACTCACGT 


2592 


CCTTGGGGAG 


GGAAATGCAC 


AATTTTTTTT 


TGTTAGGCTG 


TAAAGAATTT 


AAGCTGTAAA 


2652 


TTACATAAGT 


TAGAACAAGC 


CCAAATTTAA 


TTTGCAACCA 


TCAGAATTCA 


GAATCTATAG 


2712 


TGACCAGTGA 


TCAAGGCTAA 


TTGGAAAAGA 


GTTATCGGCC 


CATAGCTAAT 


AAGTAGTGAC 


2772 


AGACAACCAA 


GCTTCAATAT 


TTTTCTAAAG 


AAATTACAGG 


TGGGATATGC 


TAGAAAAGGC 


2832 


ATTTTGGGGT 


TATGTTTAAA 


AAAACATTAT 


TGTCCCACAA 


TATTACCTTA 


AGATTTTTCT 


2892 


TTTCCGCACT 


ACCTGAACAT 


TGTAATACAG 


ACAAACTTGA 


TTTCTTCTAG 


AAGATAACAT 


2952 


TTTCAATACT 


GTCCCACTTC 


TCATCTTAAA 


AATATTGTCA 


TGTTTATTCT 


AATATCCAAC 


3012 
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5 



10 



15 



20 



25 



30 



35 



GCAACTATCA 


AAATTGCCTT 


TTTCTCTAGA 


GGATGAAGGC 


TGTGAAAAAA 


CCGTTCAAAT 


3072 


TCTCTTCTTT 


TTCTTTTTTA 


TTACCAGGTC 


CATTTTGCCT 


GACAATTGCA 


AATCAGAGCA 


3132 


TACAAAATAA 


AACTGTGCAG 


TTTTGTTTGG 


TTTACTTTCA 


AAAGAGTAGA 


AAGCTTGAAA 


3192 


AGATTCTGAA 


ACCACAGTTT 


CATTATTCTC 


ATAATCCTTC 


TGCAACTGAA 


ATTACATATT 


3252 


GCAGGAGACA 


TTTTCATATC 


ATCAATGTGA 


CATTTACACC 


ACACTTTCAA 


AGACAATCAC 


3312 


TGAAACAAAA 


ATTGTCTTTA 


TGAGCTAAAA 


ATATGCAGAA 


TCTCTGCCTA 


GAATCTTTAT 


3372 


TCAAACTTTT 


ATTAGCCAGT 


GAAACACTTG 


CTTGCCAACT 


GCCAAGCCAT 


ACTTATTAAG 


3432 


TTCGAACATG 


TTTCACTTAA 


GGAGAGACAC 


CTAGCTTAGT 


CATGGCAAGT 


TGCCATTTTG 


3492 


TAAACTAAGG 


ATTTTGGACT 


GAGATTTCTT 


AAATCTTTCT 


TCAAATCTCC 


CACAAGTATA 


3552 


TACTTTTAAA 


TTATGGAGTA 


TTTTAAGTCT 


ACAAAAAGGT 


ATAAATAATA 


ATATAATGAA 


3612 


TTCCTATATA 


CCTAATACCC 


AGTTTAAGAC 


ACCAAATATA 


ACAAGTATAA 


TTACATCCTC 


3672 


CAATGTACCG 


TTTCCTTATT 


CCACAGATAT 


CTTTTTCATT- 


ATTGTGAAGT 


GATGTTCAGA 


3732 


TTTCTAGTTT 


TTTTTTCTAG 


TTTTTAATTT 


TAACATCAGA 


ACTGAAATAA 


AAAATTATGG 


3792 


ATACGTGTTT 


TGAATTGCAA 


ACTATTCCTC 


AGGAATTCCA 


ATTAAATTTA 


TTTTACTTGA 


3852 


ATAGGAATGA 


TCATAAAAGT 


GATTCTTTTT 


TTGTGACTAG 


AAATTCTTAA 


GCCGATGGTC 


3912 


ACTATAGCTC 


ATCCTTAATG 


TATGGCTCAT 


TTGCTTTTGT 


CACTAAACGG 


TTTTGTGTTA 


3972 


GAACCACCAA 


AATTATAGCT 


TTTAAGAGCT 


TCCTTTGACC 


ACTGTCTTTT 


TCTTACCCTA 


4032 


CTTCTCTTAT 


CTTTGATCGT 


ATATTTCTCA 


TAATGTGAAA 


TATGATGAGA 


TTCACTTAGG 


4092 


GGCAGCATGT 


TAGTTTTGGG 


AGGCAATGTC 


AACTGTGTCT 


CTGAATTCCT 


GTCTTCCAAA 


4152 


TTGAAGCCAG 


ACCATGCTGA 


TGACCTCAAG 


TAGCACTGAC 


TATTTGACAA 


TAGGGCTGAT 


4212 


AATGTAATCG 


GCTTGAATTT 


TGACTTAGTA 


ACTTTTTATG 


TAATACTTTC 


GGAGAAATTC 


4272 


TCTTTAGGAC 


AAAGCAGAGA 


GTCCAATTTA 


TTGAGGGATA 


GATTGTATCT 


C 


4323 



40 



45 



50 
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INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: small intestine 

(ix) FEATURE: 

(A) NAME/ KEY: Domain 

(B) LOCATION: 413.. 470 

(D) OTHER INFORMATION: /label- LIM1 

/note- "first LIM domain of LPP gene product" 

(ix) FEATURE: 

(A) NAME/KEY: Domain 

(B) LOCATION: 474.. 528 

(D) OTHER INFORMATION: /label" LIM2 

/note= "second LIM domain of LPP gene product" 

(ix) FEATURE: 

(A) NAME/KEY: Domain 

(B) LOCATION: 533.. 596 

(D) OTHER INFORMATION: /label= LIM3 

/note* "third LIM domain of LPP gene product" 

(ix) FEATURE: 

(A) NAME/KEY: Cleavage-site 

(B) LOCATION: 370.. 371 

(D) OTHER INFORMATION: /label= BREAK-POINT 

/note= "breaking-point within the LPP gene" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ser His Pro Ser Trp Leu Pro Pro Lys Ser Thr Gly Glu Pro Leu 
1 5 10 15 

Gly His Val Pro Ala Arg Met Glu Thr Thr His Ser Phe Gly Asn Pro 
20 25 30 

Ser He Ser Val Ser Thr Gin Gin Pro Pro Lys Lys Phe Ala Pro Val 
35 40 45 

Val Ala Pro Lys Pro Lys Tyr Asn Pro Tyr Lys Gin Pro Gly Gly Glu 
50 55 60 

Gly Asp Phe Leu Pro Pro Pro Pro Pro Pro Leu Asp Asp Ser Ser Ala 
65 70 75 80 
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Leu Pr Ser lie Ser Gly Asn Phe Pro Pro Pro Pro Pro Leu Asp Glu 
85 90 95 

Glu Ala Phe Lys Val Gin Gly Asn Pro Gly Gly Lys Thr Leu Glu Glu 
100 105 110 



10 



Arg Arg Ser Ser Leu Asp Ala Glu lie Asp Ser Leu Thr Ser He Leu 
115 120 125 

Ala Asp Leu Glu Cys Ser Ser Pro Tyr Lys Pro Arg Pro Pro Gin Ser 
130 135 140 



15 



SO 



25 



30 



Ser Thr Gly Ser Thr Ala Ser Pro Pro Val Ser Thr Pro Val Thr Gly 
145 150 155 160 

His Lys Arg Met Val lie Pro Asn Gin Pro Pro Leu Thr Ala Thr Lys 
165 170 175 

Lys ser Thr Leu Lys Pro Gin Pro Ala Pro Gin Ala Gly Pro lie Pro 
180 185 190 

Val Ala Pro lie Gly Thr Leu Lys Pro Gin Pro Gin Pro Val Pro Ala 
195 200 205 

Ser Tyr Thr Thr Ala Ser Thr Ser Ser Arg Pro Thr Phe Asn Val Gin 
210 215 220 

Val Lys Ser Ala Gin Pro Ser Pro His Tyr Met Ala Ala Pro Ser Ser 
225 230 235 240 

Gly Gin lie Tyr Gly Ser Gly Pro Gin Gly Tyr Asn Thr Gin Pro Val 
245 250 255 



35 



Pro Val Ser Gly Gin Cys Pro Pro Pro Ser Thr Arg Gly Gly Met Asp 
260 265 270 

Tyr Ala Tyr lie Pro Pro Pro Gly Leu Gin Pro Glu Pro Gly Tyr Gly 

275 280 285 



40 



45 



Tyr Ala Pro Asn Gin Gly Arg Tyr Tyr Glu Gly Tyr Tyr Ala Ala Gly 
290 295 300 

Pro Gly Tyr Gly Gly Arg Asn Asp Ser Asp Pro Thr Tyr Gly Gin Gin 
305 310 315 320 

Gly His Pro Asn Thr Trp Lys Arg Glu Pro Gly Tyr Thr Pro Pro Gly 
325 330 335 

Ala Gly Asn Gin Asn Pro Pro Gly Met Tyr Pro Val Thr Gly Pro Lys 
340 345 350 



50 



Lys Thr Tyr lie Thr Asp Pro Val Ser Ala Pro Cys Ala Pro Pro Leu 
355 360 365 

Gin Pro Lys Gly Gly His Ser Gly Gin Leu Gly Pro Ser Ser Val Ala 
370 375 380 
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w 



15 



20 



25 



30 



35 



40 



Pro Ser Phe Arg Pro Glu Asp Glu Leu Glu His Leu Thr Lys Lys Met 
385 390 395 400 

Leu Tyr Asp Met Glu Asn Pro Pro Ala Asp Glu Tyr Phe Gly Arg Cys 
405 410 415 

Ala Arg Cys Gly Glu Asn Val Val Gly Glu Gly Thr Gly Cys Thr Ala 
420 425 430 

Met Asp Gin Val Phe His Val Asp Cys Phe Thr Cys lie lie Cys Asn 
435 440 445 

Asn Lys Leu Arg Gly Gin Pro Phe Tyr Ala Val Glu Lys Lys Ala Tyr 
450 455 460 

Cys Glu Pro Cys Tyr lie Asn Thr Leu Glu Gin Cys Asn Val Cys Ser 
465 470 475 480 

Lys Pro lie Met Glu Arg lie Leu Arg Ala Thr Gly Lys Ala Tyr His 
485 490 495 

Pro His Cys Phe Thr Cys Val Met Cys His Arg Ser Leu Asp Gly lie 
500 505 510 

Pro Phe Thr Val Asp Ala Gly Gly Leu lie His Cys lie Glu Asp Phe 
515 520 525 

His Lys Lys Phe Ala Pro Arg Cys Ser Val Cys Lys Glu Pro lie Met 
530 535 540 

Pro Ala Pro Gly Gin Glu Glu Thr Val Arg lie Val Ala Leu Asp Arg 
545 550 555 560 

Asp Phe His Val His Cys Tyr Arg Cys Glu Asp Cys Gly Gly Leu Leu 
565 570 575 

Ser Glu Gly Asp Asn Gin Gly Cys Tyr Pro Leu Asp Gly His lie Leu 
580 585 590 

Cys Lys Thr Cys Asn Ser Ala Arg lie Arg Val Leu Thr Ala Lys Ala 
595 600 605 



Ser Thr 
610 



Asp Leu 



45 



SO 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4067 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

{iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
is (A) ORGANISM: Homo sapiens 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 12 

(B) MAP POSITION: 12ql5 

20 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 812.. 1141 

(D) OTHER INFORMATION: / codon_S tart- 812 
/product* "HMGI-C" 
/label= HMGI-C 

25 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 








CTTGAATCTT 


GGGGCAGGAA 


CTCAGAAAAC 


TTCCAGCCCG 


GGCAGCGCGC 


GCTTGGTGCA 


60 


AGACTCAGGA 


GCTAGCAGCC 


CGTCCCCCTC 


CGACTCTCCG 


GTGCCGCCGC 


TGCCTGCTCC 


120 


CGCCACCCTA 


GGAGGCGCGG 


TGCCACCCAC 


TACTCTGTCC 


TCTGCCTGTG 


CTCCGTGCCC 


180 


GACCCTATCC 


CGGCGGAGTC 


TCCCCATCCT 


CCTTTGCTTT 


CCGACTGCCC 


AAGGCACTTT 


240 


CAATCTCAAT 


CTCTTCTCTC 


TCTCTCTCTC 


TCTCTCTGTC 


TCTCTCTCTC 


TCTCTCTCTC 


300 


TCTCTCTCTC 


GCAGGGTGGG 


GGGAAGAGGA 


GGAGGAATTC 


TTTCCCCGCC 


TAACATTTCA 


360 


AGGGACACAA 


TTCACTCCAA 


GTCTCTTCCC 


TTTCCAAGCC 


GCTTCCGAAG 


TGCTCCCGGT 


420 


GCCCGCAACT 


CCTGATCCCA 


ACCCGCGAGA 


GGAGCCTCTG 


CGACCTCAAA 


GCCTCTCTTC 


480 


CTTCTCCCTC 


GCTTCCCTCC 


TCCTCTTGCT 


ACCTCCACCT 


CCACCGCCAC 


CTCCACCTCC 


540 


GGCACCCACC 


CACCGCCGCC 


GCCGCCACCG 


GCAGCGCCTC 


CTCCTCTCCT 


CCTCCTCCTC 


600 


CCCTCTTCTC 


TTTTTGGCAG 


CCGCTGGACG 


TCCGGTGTTG 


ATGGTGGCAG 


CGGCGGCAGC 


660 


CTAAGCAACA 


GCAGCCCTCG 


CAGCCCGCCA 


GCTCGCGCTC 


GCCCCGCCGG 


CGTCCCCAGC 


720 


CCTATCACCT 


CATCTCCCGA 


AAGGTGCTGG 


GCAGCTCCGG 


GGCGGTCGAG 


GCGAAGCGGC 


780 
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TGCAGCGGCG GTAGCGGCGG CGGGAGGCAG G ATG AGC GCA CGC GGT GAG GGC 832 

Met Ser Ala Arg Gly Glu Gly 
1 5 

GCG GGG CAG CCG TCC ACT TCA GCC CAG GGA CAA CCT GCC GCC CCA GCG 880 
Ala Gly Gin Pro Ser Thr Ser Ala Gin Gly Gin Pro Ala Ala Pro Ala 
10 15 20 

CCT CAG AAG AGA GGA CGC GGC CGC CCC AGG AAG CAG CAG CAA GAA CCA 928 
Pro Gin Lys Arg Gly Arg Gly Arg Pro Arg Lys Gin Gin Gin Glu Pro 
25 30 35 

ACC GGT GAG CCC TCT CCT AAG AGA CCC AGG GGA AGA CCC AAA GGC AGC 976 
Thr Gly Glu Pro Ser Pro Lys Arg Pro Arg Gly Arg Pro Lys Gly Ser 
40 45 50 55 

AAA AAC AAG AGT CCC TCT AAA GCA GCT CAA AAG AAA GCA GAA GCC ACT 1024 
Lys Asn Lys Ser Pro Ser Lys Ala Ala Gin Lys Lys Ala Glu Ala Thr 
60 65 70 

GGA GAA AAA CGG CCA AGA GGC AGA CCT AGG AAA TGG CCA CAA CAA GTT 1072 
Gly Glu Lys Arg Pro Arg Gly Arg Pro Arg Lys Trp Pro Gin Gin Val 
75 80 85 

GTT CAG AAG AAG CCT GCT CAG GAG GAA ACT GAA GAG ACA TCC TCA CAA 1120 
Val Gin Lys Lys Pro Ala Gin Glu Glu Thr Glu Glu Thr Ser Ser Gin 
90 95 100 



GAG TCT GCC GAA GAG GAC TAGGGGGCGC AACGTTCGAT TTCTACCTCA 1168 
Glu Ser Ala Glu Glu Asp 





105 




110 










30 


GCAGCAGTTG 


GATCTTTTGA 


AGGGAGAAGA 


CACTGCAGTG 


ACCACTTATT 


CTGTATTGCC 


1228 




ATGGTCTTTC 


CACTTTCATC 


TGGGGTGGGG 


TGGGGTGGGG 


TGGGGGAGGG 


GGGGGTGGGG 


1288 




TGGGGAGAAA 


TCACATAACC 


TTAAAAAGGA 


CTATATTAAT 


CACCTTCTTT 


GTAATCCCTT 


1348 


35 


CACAGTCCCA 


GGTTTAGTGA 


AAAACTGCTG 


TAAACACAGG 


GGACACAGCT 


TAACAATGCA 


1408 




ACTTTTAATT 


ACTGTTTTCT 


TTTTTCTTAA 


CCTACTAATA 


GTTTGTTGAT 


CTGATAAGCA 


1468 




AGAGTGGGCG 


GGTGAGAAAA 


ACCGAATTGG 


GTTTAGTCAA 


TCACTGCACT 


GCATGCAAAC 


1528 


40 


AAGAAACGTG 


TCACACTTGT 


GACGTCGGGC 


ATTCATATAG 


GAAGAACGCG 


GTGTGTAACA 


1588 




CTGTGTACAC 


CTCAAATACC 


ACCCCAACCC 


ACTCCCTGTA 


GTGAATCCTC 


TGTTTAGAAC 


1648 




ACCAAAGATA 


AGGACTAGAT 


ACTACTTTCT 


CTTTTTCGTA 


TAATCTTGTA 


GACACTTACT 


1708 


45 


TGATGATTTT 


TAACTTTTTA 


TTTCTAAATG 


AGACGAAATG 


CTGATGTATC 


CTTTCATTCA 


1768 




GCTAACAAAC 


TAGAAAAGGT 


TATGTTCATT 


TTTCAAAAAG 


GGAAGTAAGC 


AAACAAATAT 


1828 




TGCCAACTCT 


TCTATTTATG 


GATATCACAC 


ATATCAGCAG 


GAGTAATAAA 


TTTACTCACA 


1888 


50 


GCACTTGTTT 


TCAGGACAAC 


ACTTCATTTT 


CAGGAAATCT 


ACTTCCTACA 


GAGCCAAAAT 


1948 
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GCCATTTAGC AATAAATAAC ACTTGTCAGC 
AAATTATCCT CTTTGTAATT TAATGAAAAG 

5 

CTAATTATGA GGTGGGAGGA GCGAAATCTA 
TAAAAAGCAA AAAAAAAAAG GGGGGGGCAA 
CCTCTCCCTC TCTCTTTTCA TGTGTATCAG 

10 

TCAAATTAAG CATATGTGTT ACTTCAAGTA 
GTGCTTTTCT TCGGCTTGAG TTCACCATCT 
TGCAATATAT CCCCACTACT CAATACTACC 

15 

TACTTATTAC ATGCTGCTAT ACACAAGCAA 
TCTAATCATC TGCAGTTCTT TTTGTACACT 
TGTGTTTCAG CATGACTATG TATTTTTCTA 

20 

TTGTTTCAGC TTCTCTGCTA GATTTCTACA 
CCTAGGTTCT TAAGGATAAT TTTCCTCAAT 
AATATTTAAA TATTACCCTC CAAGTCTGTA 

25 

AATTGACTTG CAAAGACCTA CCTCCAGACT 
TTCATTTGTT TTTTCAATGT TTGAAATAGT 
TTTTTGTAAA AGACATTTGA TAGAAAGGAA 

30 

AAATAATAAA TAAAATAAAG CCAACCTTCA 
ACAAGCCCTG CTTTTGCATA ATGCAATCAA 

35 AAGAAAATGC TTGACAAATA TTTTCATGTA 

CTCAACCAGA TTTATTTTAA ACGCTTCTTA 
GAGTGTGCTG ACTTTTTAAC ATGGTATTAT 

. n GGCTTTTGTC AGTATGGCTT TTAGTACTGA 

CCAGCTGCTT CAGGGAGGTA GTTTCAAAGG 
TCACTGTTGT GAATCACCAA AGGAGCTATG 
CTGTGCGTTA AATAAGCAAA TAAACAGTGG 
TTTGGATGGG CCTTTTAGAA ACCTCATTGG 
GTTGGCCAAA CATGGTGCAC CGAGTGATTT 

_ CTGTATGTTG TACAATCAAA ACACACTACT 



CTCAGAGCAT 


TTAAGGAAAC 


TAGACAAGTA 


2008 


GTACAACAGA 


ATAATGCATG 


ATGAACTCAC 


2068 


AATTTCTTTT 


GCTATAGTTA 


TACATCAATT 


2128 


TCTCTCTCTG 


TGTCTTTCTC 


TCTCTCTCTC 


2188 


TTTCCATGAA 


AGACCTGAAT 


ACCACTTACC 


2248 


ATACGTTTTG 


ACATAAGATG 


GTTGACCAAG 


2308 


CTTCATTCAA 


ACTGCACTTT 


TAGCCAGAGA 


2368 


TCTGAATGTT 


ACAACGAATT 


TACAGTCTAG 


2428 


TGCAAGAAAA 


AAACTTACTG 


GGTAGGTGAT 


2488 


TAATTACAGT 


TAAAGAAGCA 


ATCTCCTTAC 


2548 


TGTTTTTTTA 


ATTAAAAATT 


TTTAAAATAC 


2608 


TTAACTTGAA 


AATTTTTTAA 


CCAAGTCGCT 


2668 


CACACTACAC 


ATCACACAAG 


ATTTGACTGT 


2728 


CCTCAAATGA 


ATTCTTTAAG 


GAGATGGACT 


2788 


TCAAAAGGAA 


TGAACTTGTT 


ACTTGCAGCA 


2848 


TCAAACTGCA 


GCTAACCCTA 


GTCAAAACTA 


2908 


CACGTTTTTA 


CATACTTTTG 


CAAAATAAGT 


2968 


AAGAACTTGA 


AGCTTTGTAG 


GTGAGATGCA 


3028 


AAATATGTGT 


TTTTAAGATT 


AGTTGAATAT 


3088 


TTTTACACAA 


ATGTGATTTT 


TGTAATATGT 


3148 


TGTAGAGTTT 


TTATGCCTTT 


CTCT C CT AGT 


3208 


CAACTGGGCC 


AGGAGGTAGT 


TTCTCATGAC 


3268 


AGCCAAATGA 


AACTCAAAAC 


CATCTCTCTT 


3328 


CCACATACCT 


CTCTGAGACT 


GGCAGATCGC 


3388 


GAGAGAATTA 


AAACTCAACA 


TTACTGTTAA 


3448 


CTCATAAAAA 


TAAAAGTCGC 


ATTCCATATC 


3508 


CCAGCTCATA 


AAATGGAAGC 


AATTGCTCAT 


3568 


CCATCTCTGG 


TAAAGTTACA 


CTTTTATTTC 


3628 


ACCTCTTAAG 


TCCCAGTATA 


CCTCATTTTT 


3688 
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CATACTGAAA 


AAAAAAGCTT 


GTGGCCAATG 


GAACAGTAAG 


AACATCATAA 
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Claims 

£0 1 . Multi-tumor Aberrant Growth (MAO) gene having the nucleotide sequence of any one of the strands of any one of 
the members of the High Mobility Group protein genes or LIM protein genes, including modified versions thereof. 

2. Multi-tumor Aberrant Growth (MAG) gene as claimed in claim 1 having essentially the nucleotide sequence of the 
HMGI-C gene as depicted in figure 7, or the complementary strand thereof, including modified or elongated ver- 

25 sions of both strands. 

3. Multi-tumor Aberrant Growth (MAG) gene as claimed in claim 1 having essentially the nucleotide sequence of the 
LPP gene as depicted in figure 5, or the complementary strand thereof, including modified or elongated versions of 
both strands. 

30 

4. Multi-tumor Aberrant Growth (MAG) gene as claimed in claim 1 , 2 or 3 for use as a starting point for designing suit- 
able expression-modulating compounds or techniques for the treatment of non-physiological proliferation phenom- 
ena in human or animal. 

35 5. Multi-tumor Aberrant Growth (MAG) gene as claimed in claim 1 , 2 or 3 for use as a starting point for designing suit- 
able nucleotide probes for (clinically/medically) diagnosing cells having a non-physiological proliferative capacity as 
compared to wildtype cells. 

6. Protein encoded by the Multi-tumor Aberrant Growth (MAG) gene as claimed in claim 1 , 2 or 3 for use as a starting 
40 point for preparing suitable antibodies for (clinically/medically) diagnosing cells having a non-physiological prolifer- 
ative capacity as compared to wildtype cells. 

7. Derivatives of the MAG gene as claimed in claim 1 , 2 or 3 for use in diagnosis and the preparation of therapeutical 
compositions, wherein the derivatives are selected from the group consisting of sense and anti-sense cDNA or 

45 fragments thereof, transcripts of the gene or practically usable fragments thereof, fragments of the gene or its com- 
plementary strand, proteins encoded by the gene or fragments thereof, antibodies directed to the gene, the cDNA, 
the transcript, the protein or the fragments thereof, as well as antibody fragments. 

8. In situ diagnostic method for diagnosing cells having a non-physiological proliferative capacity, comprising at least 
so some of the following steps: 

a) designing a set of nucleotide probes based on the information obtainable from the nucleotide sequence of 
the MAG gene as claimed in claim 1 or 2, wherein one of the probes is hybridisable to a region of the aberrant 
gene substantially mapping at the same locus as a corresponding region of the wildtype gene and the other 

55 probe is hybridisable to a region of the aberrant gene mapping at a different locus than a corresponding region 

of the wildtype gene; 

b) incubating one or more interphase or metaphase chromosomes or cells having a non-physiological prolifer- 
ative capacity, with the probe under hybridising conditions; and 

c) visualising the hybridisation between the probe and th gene. 
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9. Method of diagnosing celts having a non-physiological proliferative capacity, comprising at least some of the follow- 
ing steps: 

a) taking a biopsy of the cells to be diagnosed; 
s b) isolating a suitable MAG gene-related macromolecule therefrom; 

c) analysing the macromolecule thus obtained by comparison with a wildtype reference molecule preferably 
from the same individual. 

1 0. Method as claimed in claim 9, comprising at least some of the following steps: 

w 

a) taking a biopsy of the cells to be diagnosed; 

b) extracting total RNA thereof; 

c) preparing at least one first strand CON A of the mRNA species in the total RNA extract, which cDNA com- 
prises a suitable tail; 

is d) performing a PCR and/or RT-PCR using a MAG gene specific primer and a tail-specific and/or partner- 

specif/nested primer in order to amplify MAG gene specific cDNA's; 

e) separating the PCR products on a gel to obtain a pattern of bands; 

f) evaluating the presence of aberrant bands by comparison to wildtype bands, preferably originating from the 
same individual. 

20 

1 1 . Method as claimed in claim 9, comprising at least some of the following steps: 

a) taking a biopsy of the cells to be diagnosed; 

b) isolating total protein therefrom; 

25 c) separating the total protein on a gel to obtain essentially individual bands and optionally trnasferring the 

bands to a Western blot; 

d) hybridising the bands thus obtained with antibodies directed against a part of the protein encoded by the 
remaining part of the MAG gene and against a part of the protein encoded by the substitution part of the MAG 
gene; 

30 e) visualising the antigen-antibody reactions and establishing the presence of aberrant bands by comparison 

with bands from wildtype proteins, preferably originating from the same individual. 

1 2. Method as claimed in claim 9. comprising at least some of the following steps: 

35 a) taking a biopsy of the cells to be diagnosed; 

b) isolating total DNA therefrom; 

c) digesting the DNA with one or more so-called "rare cutter" restriction enzymes; 

d) separating the digest thus prepared on a gel to obtain a separation pattern; 

e) optionally transfering the separation pattern to a Southern blot; 

40 f) hybridising the separation pattern in the gel or on the blot with one or more informative probes under hybrid- 

ising conditions; 

g) visualising the hybridisations and establishing the presence of aberrant bands by comparison to wildtype 
bands, preferably originating from the same individual. 

45 13. Method as claimed in any one of the claims 8-12, wherein the cells having a non-physiological proliferative capacity 
are selected from the group consisting of the mesenchymal tumors hamartomas (e.g. breast and lung), lipomas, 
pleomorphic salivary gland adenomas, uterine leiomyomas, angiomyxomas, fibroadenomas of the breast, polyps 
of the endometrium, atherosclerotic plaques, and other benign tumors as well as various malignant tumors, includ- 
ing but not limited to sarcomas (e.g. rhabdomyosarcoma, osteosarcoma) and carcinomas (e.g. of breast lung, skin, 

so thyroid). 

14. Anti-sense molecules of a MAG gene as claimed in claim 1 , 2 or 3 for use in the treatment of diseases involving 
cells having a non-physiological proliferative capacity by modulating the expression of the gene. 

55 15. Expression inhibitors of the MAG gene as claimed in claim 1 , 2 or 3 for use in the treatment of diseases involving 
cells having a non-physiological proliferative capacity. 

16. Diagnostic kit for performing the method as claimed in claim 8, comprising a suitable set of labeled nucleotide 
probes. 
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17. Diagnostic kit for performing the method as claimed in claim 10, comprising a suitable set of labeled probes. 

18. Diagnostic kit for performing the method as claimed in claim 11, comprising a suitable set of lab led MAG gene 
specific and tail specific PGR primers. 

19. Diagnostic kit for performing the method as claimed in claim 1 1 , comprising a suitable set of labeled probes, and 
suitable rare cutting restriction enzymes. 

20. Method for isolating other MAG genes based on the existence of a fusion gene, fusion transcript or fusion protein 
in a tumor cell by using at least a part of a MAG gene for designing molecular tools (probes, primers etc.). 

21. MAG genes obtainable by the method of claim 20. 

22. MAG genes as claimed in claim 21 for use in diagnostic or therapeutic methods. 
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Fig. 1-1 
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Fig. 1-2 




s 

K 
Sf 



322 ca 



MKN 

III 



MAR 

ULCH12 



•S4D2 



345A12 
• 2Q11 



4saet 



•03C2 



-I 41 A2 

—7 341 CI 



M4F1 



N 
SI 

_U 



•3»H2 



I I 



1 



- 208(312 



253H1 



MAR 



- 322C8 



datoM r*glon In YAC 303F11 



»3*H2 
•21 B» 



P R 

t I 



S1P8 



303F11 



MAR 



253H1 



320F4 



242A2 
27SH4 



sr 



%99H2 — f> 
t2lBt ► 



244812 



164 A3 



MAR 



51 FS 



320F9 



I3IH2 
921 B» 



27SH4 



.244812 



146H7 - 
142F4 



74B11 - 
404612 



24163 



63 



EP 0 727 487 A1 



Fig. 1-3 
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Fig. 1-4 
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Fig. 4 



CG CTT CAG AAG AGA 
L Q K a 




GGA d 
G R 



GGC CGC CCC AGG AAG CAG CAG CAA A 
GRPRKQQQK 



45 



AA CCA ACC GGT GAG CCC TCT CCT AAG AGA CCC AGG GGA AGA CCC A 
PTGEPSPKRPRGRPK 



90 



AA GGC AGC AAA AAC AAG AGT CCC TCT AAA GCA GCT CAA GAG GAA G 
GSJCNKSPSKAAQEEA 



135 




225 



180 



GGHSGQLGBSSVAPS 

CA TTC CGC CCA GAG GAT GAG CTT GAG CAC CTG ACC AAA AAG ATG C 270 
FRPSOSLSHCTXKMli 

TG TAT GAC ATG GAA AAT CCA CCT GCT GAC GAA TAC TTT GGC CGC T 315 
YDMEffP~PADEYFGRC 

GT GCT CGC TGT GGA GAA AAC GTA GTT GGG GAA GCT ACA GGA TGC A 360 
ARCGENVV GEGTGCT 

CT GCC ATG GAT CAG GTC TTC CAC GTG GAT TGT TTT ACC TGC ATC A 405 
AMDQVFHVD CFTCII 

TC TGC AAC AAC AAG CTC CGA GGG CAG CCA TTC TAT GCT GTG GAA A 450 
C MNKLRGQP F Y A V E K 

AG AAA GCA TAC TGC GAG CCC TGC TAC ATT AAT ACT CTG GAG CAG T 495 
KAYCEPCYI tfTLSQC 

GC AAT GTG TGT TCC AAG CCC ATC ATG GAG CGG ATT CTC CGA GCC A 540 
NVCSKPXMERXLRAT 

CC GGG AAG GCC TAT CAT CCT CAC TGT TTC ACC TGC GTG ATG TGC C 585 
GKAYHPHC FTCVMCH 

AC CGC AGC CTG GAT GGG ATC CCA TTC ACT GTG GAT GCT GGC GGG C 630 
R S I*DGXPFTVDAGGX» 

TC ATT CAC TGC ATT GAG GAC TTC CAC AAG AAA TTT GCC CCG CGG T 675 
X HC IEDFKKKFAPRC 

GT TCT* GTG TGC AAG GAG CCT ATT ATG CCA GCC CCG GGC CAG GAG G 720 
SVCKEPIMPAPGQEE 

AG ACT GTC CGT ATT GTG GCT TTG GAT CGA GAT TTC CAT GTT CAC T 765 
TVRIVALDRDFHVHC 

GC TAC CGA TGC GAG GAT TGC GGT GGT CTC CTG TCT GAA GGA GAT A 810 
YRCEDCGG I* I* S E G D fcf 

AC CAA GGC TGC TAC CCC TTG GAT GGG CAC ATC CTC TGC AAG ACC T * 855 
QGCYPLDGHILCKTC 

GC AAC TCT GCC CGC ATC AGG GTG TTG ACC GCC AAG GCG AGC ACT G 900 
NSARIRVLTAKASTD 

AC CTT TAG ATT CAG TCA CCT GTT CAG CCG GCA CTG AGA AGA ACG A 945 
L * XQSPVQ PAI»RRTN 

AC ACA AGA AAA AGA TAA GAA ATA CPA GAG TAA AGG CCA TCA AAC T 990 
TRKR + EXLE + RPSHY 

AC GCG AAA AAA AAA AAA AAA AAA AAA GAT GTC GAC GGA TCC TT 1033 
AKKKKXKKDVDGS* 



68 



EP0 727 487A1 



Fig. 5-t 
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Fig. 5-2 
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Fig. 5-3 

TAGAATCTTT ATTCAAACTT Xl'ATTAGCCA GTGAAACACr TOCTTGCCAA CTGCCAAGCC 3420 
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TATTTTACTT GAATAGGAAT GATCATAAAA GTGATTCTTT TTTTGTGACT AGAAATTCTT 3900 

AAGCCGAXGG TCACTATAGC TCAICCTTAA TGTATGGCTC ATTTGCITTT GTCACTAAAC 3960 
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AATAGGGCTG ATAATGTAAT CGGCJJTGAAT TTTCiACTTAG TAAC1TTTTA TGTAATACTT 4260 

TCGGAGAAAT TCTCTTTAGG ACAAAGCAGA GAGTCCAAXT TATTGAGGGA TAGATTGTAT 4320 

CTC 4323 
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Fig. 6 
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Fig- 7-1 



BASE COUNT 1119 ft *BS C 791 g 1169 t 

ORIGIN 

1 cttgaatctt ggggcaggaa ctcagaaaae ttccagcocg ggcagcgoge gcttggtgoa 
61 agaotoaggii yctagoagee egtooccctc egaatctccg gtgoogccgc tgcctgotoo 
121 ogaoAoerot* ggaggagogg tgooacccac laetotgtcc tctgcatgtg ctecgtgoeo 
101 gaooctatcc oggeggagtc tocceatcct cotttgcttt ccgaetgeee aaggoaettt 
241 a«»totcaat etettotcto tctctctctc tctetetgtc tctctctoto tctctctoto 
301 tctutwtcto gcagggtggg gygaagagga ggaggaattc tttccccgcc taaoatttca 
361 AgggauAum* ttoaotocaa gtotcttccc tLtgoaagcc gcttoogaag tgctoooggt 
421 gooogeaaci eetgatccca accggogaga ggagcotctg cgacctoaaa gectctctto 
461 ottctccctc gottceetcc tootcttgct acoteeaeet ccaccgeeac ctooaoctce 
541 ggeacceaoo oaccgcogoc gccgocaccg gcagcgeetc ctcctotcct cc toot cat e 
601 ccctcttctc tttttggcag ccgctggacg tceggtgttg atggtggcag oggoggcago 
661 ctaagcaaoa geagccctcg cagcecgoca gctcgcgetc gccocgcogg cgtooooagc 
721 eetatuaoot oatetaecga aaggtgctgg geagotcegg ggcggtegag gc ga agcgg o 
761 tgcagoggcg gtaguggcgg egygaggcag gatgagogca cgcggtgagg gggegggyc* 
841 gcogtccact tcagoccagg gauaaoctgc cgcecoagcg eeteagaaga gaggac gcgg 
" 901 qegooaoagg aayeagoagc aagaacoaac eggtgagooo tctcetnaga gacccagggg 
961 aagacccaaa ggoagcaaa* aeaagagUc qtctaaagca gotcaaaaga aagcagaago 
1031 oaotggagaa aaacggceaa gaggeagaco taggaaatgg ooa c aa c aa g ttgttcagaa 
1061 gaagoetgct oaggaggaaa etgaagagac atootoacaa gagtotgccg aagaggacta 
1141 gggggogeaa egttcgaLtt ctacoteage aggagttgga tottttgaag ggag aagac a 
1201 otgeagtgac cAcLUttct gtattgeeat ggtctttooa ctttoatotg gggtggggtg 
1261 gggtggggtg gygyaggggg gggtggggtg gggagaaatc aeataaoott aaaaaggaet 
1121 Atattaaioa gpucLttgt aatcccttca cagtccoagg tttagtgaaa aactgctgta 
1361 aaoaeagggg acacagctta aeaatgcaae ttttaattae tgttttcttt tttcttaacc 
1441 taotaatagt ttgttyatct gataageaag agtgggcggg tgagaaaaao egaattgggt 
1501 ttagtoaato actvcaotgc atgcaaacaa gaaacgtgtc aeaettgtga cgtcgggoat 
1561 tcatatagga agaaegoggt gtgtaaoaet gtgtaoaoot caaataooac coc»aeecac 
1621 tccotgtagf. gaatcetcty tttagaacao caaagataag yactagatac tactttetct 
1631 ttttogtata atcttgtaya cacttacttg atgattttta actttttatt tctaaatgag 
1741 acgaaatgct gatgtatcot ttcattcago taaeaaaeta gaaaaggtta tgttoatttt 
1rt01 tcaaaaaggg aagtaagcaa auaaatattg ocaaotcttc tatttatgga tatcacaoat 
18C1 atcageagga gta*taaatt taotoaoagc acttgtttte aggacaaoac ttoattttca 
1921 ggaaatctae ttcctaeaga gcoaaaatge catttaggaa taaataacac ttgteaggct 
1981 oagageattt aaggaaaota gaoaagtaaa attatcctgt ttgtaattta atgaaaaggt 
2041 aoaaoaoaat aatgeatgat gaactoacet aattatgagg tgggaggago gaaatotaaa 
2101 tttcttttgo tatagtt*i.a oatcaattta aaaageaaaa aaaaaaaggg gggggcaato 
2161 tntctctgtg totttotetc tetctotoog tctccctctc tcttttcatg tgtateagtt 
2321 trsr^tgaaag acctgaata© eacttaootc aaattaagca tatgtgttag ttcaagtaat 
2281 aegttttyac ataagatggt tgaetaiaggt gcttttotto qycttgagtt caccatotct 
2341 tcatteaaao tgcactttta gccagagatg caatatatcc ccactactca atactacotc 
2401 tgaatgttac aaegaattta eagtctagta cttattacat ggtgctatae aeaagcaatg 
2461 oaagaaaaaa acttactggg taggtgattc taatcatatg cagttctttt tgtacactta 
2521 attaoagtta aagaageaat cteettactg tgtttoagca tgactatgta tttttctatg 
2581 Lttttttaat taaaaatttt taaaatactt gtttcagctt otctgotaga tttctacatt 
2641 aaottgaaaa ttttttaaec aagtcgctcc taggttctta aggataattt tcetcaatca 
27m caetacaoat oaoacaagat ttgactgtaa tatttaaata ttaccctcca agtctgtacc 
2761 toaaatgaat totttaagga yatggaetaa ttgacttgca aagaootacc tceagactto 
2821 «taaaggaatg aaettgttac ttgcagcatt catttgtttt ttcaatgttt gaaatagtto 
2881 aaaotgeage t»agcctagt eaaaactatt tttgtaaaag aeatttgata gaaaggaaca 
2941 cgtttttaoa taottttgga Mataagtan *taataaata aaataaagec aauottcaaa 
3001 gaaottgaag otttgtaggt gagatgcaar. aagoeetgdt tttgoataat goaatcaaaa 
3061 atatgtgttt ttaagattag ttgaaUtaa gaaaatgctt gacaaatatt ttcatgtatt 
3121 ttacacaaat gtgatttttg taatatgtct caaeeagatt tattttaaae gcttettatg 
3181 tagagttttt atgoctttot ctcctagtga gtgtgctgae tttttaacat ggtattatca 
3241 actgggccag gaggtagttt otcatgacgg cttttgte*g tatggctttt agtactgaag 
3301 ecaaatgaaa cteaaaacca tctotcttec agctgcttea gggaggtagt ttoaaaggee 
3361 acatacctct ctgagaotgg eagatogcto aetgttgtga ateaecaaag gagotatgga 
3421 gagaattaaa actcaagatt actgttaaot gtgcgttaa* taageaaata aaoagtggot 
3481 oata«aaata aaagtcgcat tccatatctt tggatgggee ttttagaaae ctoattggoo 
3541 agctoataaa atggaa V caa ttg tcatgt tggooaaaea tggtgoaocg agtgatttoo 
3601 atctctggta aagttacact tttattt ct glatgttgta eaatoaaaae acaetactao 
3661 ctattaagtc ccagtatacc tcatttttoa ta tgaaaaa aaaagcttgt ggceaatgga 
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Fig. 7-2 

3721 acagtaagaa uatuataaaa tttttatata tatagtttat ttttgtggga gataaatttt 

3781 ataggaotgt tatttgatgt tgttggtcgo agctacataa gactggacat ttaacttttc 

3A41 taoaatttct gcaagttaw tatgtttgca ygagaaaagt ateaagacgt ttaaetgcag 

3901 ttgactttct cactgttcct ttgagtgtct tctaacttta ttctttgtto tttatgtaga 

3961 attgctgtct atgattgtac tttgaatcyc ttgettgttg aaaatattto tctagtgtat 

4021 tdtcactgtc tgttctgcac aataaacata acagcctctg tgatcoo 
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Fig. 8 
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